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PREFACE 


The Second All-Union Conference on Theoretical and Applied Mechanics 
was held in Moscow from 29 January to 5 February 1964, under the joint 
auspices of the National Committee of the USSR for Theoretical and Applied 
Mechanics, the Institute of Mechanics of the Academy of Sciences of the 
USSR, and the Moscow State University im. M.V. Lomonosov. 

The work of the conference was directed by an organizing committee 
consisting of: I. J. Artobolevskii, N. Kh. Arutyunyan, G.I. Barenblatt, 
L.A.Galin, A.L.Gol'denveizer, G. Yu. Dzhanelidze, A.A. Dorodnitsyn, 

A. Yu.Ishlinskii, S.V. Kalinin, L. M.Kachanov, M.V. Keldysh, P.Ya. Kochina, 
L.G. Loitsyanskii, A.I. Lur'e—vice-chairman, Yu.A. Mitropol'skii, 

G.K. Mikhailov—scientific secretary, N.N. Moiseev, N.I. Muskhelishvili— 
chairman, A.A. Nikol'skii—vice-chairman, D. E. Okhotsimskii, G.I. Petrov, 
I. M. Rabinovich, Yu. N. Rabotnov—vice-chairman, V.V.Rumyantsev, 

L.I. Sedov—first vice-chairman, V.V.Sokolovskii, G.G. Chernyi, 
D.I.Sherman. Technical secretary of the organizing committee — 

A.J]. Derzhavina. 

The work of the conference proceeded in three sections. 

Section I. General and applied mechanics (chairman—A. Yu. Ishlinskii, 
vice-chairmen—A.JI.Lur'e, D. E. Okhotsimskii, scientific secretary — 
V.V.Rumyantsev). Section I comprised the subsections: 

1. Analytical mechanics and the theory of stability of motion; 
I-2. Celestial mechanics; 
I-3. Oscillations and control; 
I-4. Gyroscopy; 
I-5. Theory of mechanisms and machines; 
I-6. Problems in teaching of mechanics; 

Section I. Fluid mechanics (chairman-—L.I. Sedov, vice-chairmen— 
L.G. Loitsyanskii and G.G. Chernyi, scientific secretary—N.N. Moiseev). 
Section II comprised the subsections: 

II-1. General hydromechanics; 

II-2. Aerodynamics and gas dynamics; 

II-3. Theory of plasma and rarefied gases; 

II-4. Motion of viscous fluids, boundary layers, turbulence and heat 
transfer; 

II-5. Hydrodynamics of multi-component systems; 

II-6. Applied fluid dynamics. 

Section III. Mechanics of solids (chairman — N.I. Muskhelishvili, 
vice-chairmen—A. L. Gol'denveizer and Yu. N. Rabotnov, scientific 
secretary—G.I. Barenblatt). Section II comprised the subsections: 

III-1. Theory of elasticity; 

IlII-2. Theory of plasticity; 

IlI-3. Theory of plates and shells; 


Ili-4. Rheology and theory of creep; 

III-5. Constructional mechanics; 

II-6. Soil mechanics; 

Over 2000 members and 3000 guests from all the principal scientific 
and industrial centers of the country participated in the conference. A 
total of 560 papers were read in 125 sections and subsection meetings. 

Of a total of over 100 survey papers specially commissioned by the 
organizing committee, those submitted to the organizing committee were 
included in the prepared ''Proceedings" of the conference. 

The proceedings of the conference are being published in three 
parts:* 

1. Analytical mechanics. Stability of motion. Celestial ballistics. 


2. Oscillations. Gyroscopy. Theory of mechanisms. Fluid mechanics. 
3. Mechanics of solids. 


* [The present volume is a translation of Part 1 of the Proceedings. } 
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ANALYTICAL MECHANICS. 
STABILITY OF MOTION 


I. S. Arzhanykh 


DEVELOPMENT OF THE THEORY OF CANONICAL 
EQUATIONS OF ANALYTICAL MECHANICS 


Hamiltonian canonical systems have some remarkable properties of 
relevance for integration problems. In this connection attempts have been 
made to extend certain theorems of dynamics to arbitrary systems of 
ordinary differential equations 


dx 
r= Xalbe tts Meee tn) R12... A. (1) 


These investigations fall into three categories: 1) a method is indicated 
for reducing system (1) to a Hamiltonian system (this is the approach of 
Liouville, Lie and Koenigs), 2) integration theorems for Hamiltonian 
systems are extended to system (1), without reducing it to any specific 
‘canonical’ form (this method is applied to Poisson's Theorem in § 2): 
3) some new universal canonical structure is proposed for system (1), 
generalizing the Hamiltonian canonical structure, and having properties 
which also generalize those of Hamiltonian systems (this approach is 
described in §§ 3—S5). 


§ 1. The methods of Liouville and Lie-Koenigs 


Along with the variables x, we introduce the variables y, and form the 
function 


H = D) yaXalt, Xt). 00s Sn): 
=i 


Equations (1) can then be written in the form 
Xa = OH /Oys; 
if the new variables are determined by the equations 
Ya = —0H/Oxs, (2) 


we obtain a Hamiltonian system. This is Liouville's line of reasoning. We 
note that system (2) is the conjugate with respect to the variational equations, 
i.e., the equations define the coefficients of the linear integral invariant 


y 2 YaOXy. (3) 
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Lie and Koenigs /1/ start from this point. Let system (1) have the integral 
invariant (3). We reduce it to canonical form 


1D pdqn, 2m Sn (3") 


&,=1 


and regard the variables p,, and qs, as unknowns. Then, introducing other 
new variables 4), uy,..-.,Una-n-,, we obtain a Hamiltonian system 


Qe, = 9Hi/Opn, Pa, = — OH /O9e,, } (4) 
Ay = Ay, (¢, 1, ooo qa,» Pr, coe Pa,) 
and a supplementary system 
Us = U, (t, Jas ee +> Yayo Pare >os Pry» ry oo oy Un-sn,)- (1') 


Integrating (4) and substituting qs, and ps, in (1'), we obtain the system 


u,=V, (2, ta, -- +» Un-gn,)> 


(5) 


s=1,2,...,n—2n, 


to which we apply the same transformations. We obtain a chain of Hamil- 
tonian canonical systems. 

The disadvantage of Liouville's method is that it introduces many super- 
fluous (redundant) unknowns. The Lie-Koenigs method avoids this, but 
requires knowledge of the integral invariant (3), i.e., we must first find 
the solutions of the Liouville system (2) as functions of ¢ and x,.. These 
methods are therefore only of general theoretical interest. 


§ 2. Poisson's theorem 


The relevance of Poisson's theorem for Hamiltonian canonical equations 
is quite obvious from the following considerations. If @ = a= const is an 
integral, then the variational equations have the solutions 


bq,= ed9/Op,, 5p, = —edq/dq,, & =const. (6) 


Consequently, if we have a second integral »= 6, then the integral of the 


expanded system will be 6=ec. Therefore @,) =c is also an integral of 
the Hamiltonian system. 


For an arbitrary system (1) Poisson's theorem is replaced by Boole's 
theorem: if there exists an integral 


a 
Dy On (ts Xa. Xn) Ya = 
kw] 
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for the Liouville system and F = const is an integral of system (1), then 
another integral of this system is 


a 
OF 
O--— = O. (7 
Py Ox, ) 


The Liouville system, as we know, is related to the linear integral 
invariants. It is therefore natural to try to generalize Boole's theorem 
(and thereby also Poisson's theorem) on the basis of the theory of integral 
invariants. The first investigations of this type are due to Poincaré /2/. 
We shall describe a more direct approach. 

Suppose system (1) admits m linear integral invariants 


= { Due, p=1,2,..., m. (8) 


We denote by [x, x... x,, ) the determinant 
D y() 

uP a) 6. ud 

ul ulm)... um) 


and set up the skew-symmetric form 


(il §veaTa) = DD noe) Remy (tite), (9) 


Va S VaGewe <vy, 


where 4,,,...., i8 a skew-symmetric tensor satisfying the following system 
of equations: 


m 

: ax 

The = > >> Fag peat pet (1 0) 
p=] haw) 


Then, as is easily seen, another integral is 
{il,...Jm} = const. (11) 
If system (1) has the integrals F,=c,, then it is well-known that 
uv) = OF /Ox, (12) 
and, consequently, we have the integral 


IF... Sin, Sy = const (13) 


<<... <I aie 0 (4,5, to 0 Sy 
In particular, for the Hamiltonian system 
Xy = OH /OxXasv, Xaovw = — 0H/dx, 


the components of the tensor x are Ky...4,n4n...0¢4=1, and all the others 
arezero. Therefore, if this system hasthe integrals F,=o¢Q@=1, 2,..., 2) 
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then the following will also be an integral: 


O(FiFaFs... Fes) 
menciccey 0 lho: Ryinsa ence = const. (14) 
This is Laurent's generalization of Poisson's theorem. 

The formulation of the Poisson-Laurent theorem is thus based on the 
fact that a particular solution of system (10) can be found in explicit form. 
In some cases the structure of the components of the tensor x for the 
general system (1) can be indicated. For example, if we have the integrals 
G=o(A=21,2,...,a—m), then 


8 0(G:G,G.... Ga-m) 
Thyy...0m = Te O (x1... Sy Xu gg- ++ Xa) . (1 5) 


where M is the last Jacobi multiplier, and 


8 = (—! yr 


As an example we mention the Poisson equation system describing the 


rotation of a heavy rigid body about a fixed point (three classical integrals, 
M =1). 


§ 3. Canonical equations of rank higher than zero 


Let us assume that the system in question is of even order 2n, and write 
it in the form 


qe = Q(t, Ji» oe ey Gay Pay soos Pn) : 


Po = Po(t, Gas +-+ Gur Pay ++ +» Pa) ne) 
We set up the linear form 
w (8) = 3 (aap, — Pq.) (17) 
and reduce it to canonical form 
w (6) = 6H + F,6H,. (18) 
emi 


Consequently, our system is represented in canonical form of rank 7 /3/: 


oa OH Sp OMe 
Qe ap, + 2s Fo 3 
(19) 


- OH oS gp OM, 
Po — aD Fe ae 
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The canonical potentials H, H, and the factors F, may be modified by 
transformations of the form 


FAH, + Fada +... = 8 (Fil + Falla t+...) — HAF: —ABFsy— .... 


An invariant of any such transformation is the quantity 


fey a ee (20) 


As an example of the reduction of equations of motion to canonical form 
of rank higher than zero, we consider a holonomic system subjected to 
nonconservative forces Q, (f,9,,..-.4). We represent the elementary work 
in canonical form 


D049, = dU + D V,0u,. (21) 


We then obtain a canonical system of rank r, where 


H=—T—U+ Dipg, Fo=Ve, Hp=U,. 


Mechanical systems moving under the action of forces R, (t,q,...,4a> 
P1,.--, Pa), for example, nonholonomic, are easily reduced to canonical 
form of rank an, where 


H=—L + 9} paot Das Fi, =0,. H, = —R,. (22) 


The equations of the theory of perturbations in a nonconservative force 
field also have canonical form of order n(see /4/, p. 564). 

For systems (19) Poisson's theorem has a distinctive formulation: if 
@ =a and »= 6 are two integrals of the system, then 


: A,, ® H,, 
@tIZey, Ewe ne es 


From the standpoint of the theory of the momentum field /5/, i.e. , assuming 
that 


Pe = pPr(t, Jin eee Gn)» (24) 


the second group of equations of (19) may be represented in the form 


op, a(H) 3 (H) Op, Op, - 
at + aq, + QF, oq, + (age — sat) = O- a2) 


This is the rotational structure of canonical equations of rank higher than 
zero. The rotational form of the equations is convenient when generalizing 
the Hamilton-Jacobi method. 
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Consider, for example, involution systems, i.e., systems (19) whose 
principal and supplementary canonical potentials do not depend explicitly 
on time and are in involution (H 4H,) = 0. (H,,, Ho.) = 0, (p, p} Pf: = 1,2,...,7). 
Such systems possess integrals 


H=h, H,=h,, (26) 


and for their integration, as follows from equations (25), it is sufficient to 
determine the total integral w(q,,q5,.... qa, Ay...» Ae, Cy... Cary) from 
the combined system 


H(q1.--+» Qa, Ow/dq:, ..., 0w/0gn) =h 
h, 


H, (91, 209 Jar Ow/dq,, oo, Gw/0qgn) — (27) 


The following will then be integrals of the system (19) 
Py = Ow/0q,, Cw/Ih=t—to, Bdw/dg =d\ (X= 1,2,...,.2—r—l]), (28) 


and to obtain the remaining integrals we must integrate the system 


aan, (p = 1,2,..., 1). (29) 


§ 4. Invariance under contact transformations 


One of the essential properties of canonical systems of rank zero 
(Hamiltonian systems) is their invariance under contact transformations /1/ 
(see also /6/, p. 174). Canonical systems of rank higher than zero possess 
the same property. 


We carry out the transformation (p, g) — (n, §) according to the formulas 


8 ao, 
p= a +O Ae a 
om) ¥ 
Ge, (30) 
—W = 3 +B Me oe 
Wo (tf, Jis-- +r Inv bios + +s Sa) = O 


System (19) is then transformed into a similar system 


: (31) 


where 


ow, 
Z=H4+E4DA yaa O=F,, Z,=,. (32) 


The proof of this theorem is the same as that of the analogous theorem for 
Hamiltonian systems /7/. 
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The invariance property may be used for extending to system (19) a 
method similar to the Hamilton-Jacobi method. First we consider an 
involution system. We carry out the transformation 
Pp, = 0w/0qv, — Nv = Ow/Oky (33) 
and determine the function w from the equations 


H=ta, H,=€s-» (9=1,2,....7). (34) 


But then we obtain 


az oz, ez oz, © 
v=1,....9%; p=l,....a—]; Ael,...,.a—r—l 
az oz, 0, if pip 
=], = = . 
Ca Sn-w l, if Pr = p. (36) 
Therefore gE. = 0, nm = 0, tr = —), ia» = — F,, or 
d ow 
&. = const, m = const, 5- =f — te, at 5, Fo (37) 


For the general canonical system (19) we carry out the transformation 
(30) and chose s so that the system of partial differential equations 


Ow 
H+ a+ DA. a =0 
F,,=0, H,=0 (pips) , (38) 
ow, ow, / OH oH 
Br + Dag, (ap, + WF Hee) = 0 


where p, are replaced by their values from (30), is consistent. We then 
obtain the integrals § = const, n. = const. The number s is called the genus 
of the momentum field. One of the important problems of the theory of 
integration of canonical equations of rank higher than zero consists in the 
preliminary calculation of the genus of the momentum field. For involution 
systems the genus is zero. 


§ 5. Relation of the integration problem to the theory 
of Lie groups 


The following kinetic system of rank r corresponds to the canonical 
system (19): 


OL OL 
a (a + BK age) = tq t BK na (39) 
where 
Lma—H+ qn. K,=F,, L,=—H,, (40) 
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and the momenta p, must be expressed in terms of 9, using the first group 
of equations of (19). 


Many mechanical systems are easily represented in the form (39). 
Consider, for example, the equations of motion 


d oT oT 
“at “Gq. —~ OG, = Qv(bs av - + +5 In): (41) 


They can be written in the form (39) by setting r=, 
L=T, K,=Q, Le=Q. (42) 
The equations of motion of nonholonomic systems 
dt 9g. Oa, = D MyAv (43) 
also have the form (39), where K, = » M,As,. L, = q, (p = I.n). One often 
encounters chain systems ° 
— —— — — =), y=1,2,..., A 
@ 9G, — 29, = % Vg =, +1,..., (44) 


«ee eo © @ © & © © © @ @ © © @ @ @ @© &© @ @ @ 


=-— oC OC SS = 0, Va= Mg-, + 1,..., A, 


where Lia = Lo (t, 91 Ce} Gn,» Ja,ris oes 8 qa, 43, oe es Gas Ganyors evceys Ga)- They 
may also be reduced to the form of kinetic equations of rank higher than 
zero, by representing the linear form 


a au ' auia 
3 B35, 84,4 34,, 84 ) (45) 


fei % 


in canonical form 
6L + >) K,OL,. 
o 


The existence of integrals of (39) which are linear in p,, is closely related 
to the theory of Lie groups. 
Suppose the infinitesimal transformations of the group are 


Og. = euy(t, gir... 9a) (Vel, 2,..., A) (46) 


and suppose the following equality is satisfied: 


OL + 5) K,OL, = 0. (47) 
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Then we have the integral 


Su (2 


of +3K; 


x x =) = const. (48) 


In fact, let us write equality (47) in explicit form 


Di (+ BK 2) O%. + (FE + BK Gt) On] = 0. 


v 


ial a (5 +2K% sa) 8a + 
+3(% + ikea a + 3K, 52) ] 0.0. (49) 


But by virtue of the equations of motion (39) the second sum vanishes. 
Equality (47) may be satisfied due to the invariance of the principal and 
supplementary kinetic potentials (0L = 0,0l, =0), or due to the equalities 


OL=0, D)K,OL, = 0. (50) 


If we apply this case to the system (41), we see that if the kinetic energy 
is invariant with respect to the Lie group, and the work of the generalized 
forces on the displacements q, is zero, then the system admits an integral 
which is linear in the momenta. 

Applied to the nonholonomic system (43), we have the following sufficient 
conditions for the existence of a linear integral: 


OL=0, SAndg.=0, (uml, 2,..., m). (51) 


A mixed equation system 


d /@R aR, 
S (E+E) RDS Gets 


. ORe 
dnea = 3p + Se Hee (52) 


ia Pa a 
where 


R (t, qi» cece Jar 42, eee, a Punsis oo ce Pa) = 
ma —L+ 2 Pmerdeers R,= —L,, S, = K,, 


OL 
= K, ——* 5 
Pmer oe pa ° Omen ( 3) 
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also corresponds to a canonical system of rank higher than zero. Let the 


functions R and R, be invariant with respect to a group with an infinitesimal 
operator 
a 


a 
2 Un (ts Ga94s «+4 Gai Peas «+» Pn) Bq « (54) 


pat 


Then the system (52) admits the integral 


aR 
de, (s+ 25 aa) = const. (55) 
» » p » 


If the coordinates gma are cyclic, we have the integrals pm = const. 
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V.I. Arnol'd 


STABILITY AND INSTABILITY OF DYNAMIC SYSTEMS 
WITH MANY DEGREES OF FREEDOM 


§ 1. Introduction. Nearly integrable conservative systems are often 
encountered in mechanics (the many-body problem, the rapidly spinning 
top, etc.). The study of such systems often utilizes the approximate, 
asymptotic methods of perturbation theory. In the present paper we con- 
sider the correspondence between exact and approximate solutions of the 
equations of dynamics over an infinite time interval. 

It turns out that the stability of systems with one or two degrees of 
freedom may usually be ascertained on the basis of the first approximation. 
On the other hand, a system with s>3 degrees of freedom may be stable in 
all the approximations of perturbation theory, and unstable in reality. 

We start with an example of an oscillatory system with one degree of 
freedom. 

§ 2. Example. Consider the motion of a pendulum, described by the 
equation 

x+o'sinx = ef (x, ?), 


where the disturbing force f has period 2n in the angle x and the time ¢. The 
motion of the pendulum is described by a "trajectory" x (f), x (0 in the three- 
dimensional ''phase space" x, z,¢. Consider a point X = (x, z) in the plane of 
the initial conditions t=0. The trajectory issuing from this point intersects 
the plane ¢ = 2x at the point TX = (x (2n), x (2n)). By virtue of the periodicity 
in ¢, the planes ¢ = 0 and¢ = 2n canbe identified. Thus, the trajectory is 
described on the ''secant plane’ x.z by the sequence of points X,7X, ™X,..., 
where 7 is the transformation mapping X onto TX. 

First lete=0O. Then the energy E = ¥/,x* + w* (1 —cos x) is conserved, and 
all the points 7°X lie on one curve EF =const. For small energies (E < 2%), 
this curve is closed. One may define the ''frequency' Q (E) as the limit 
lim N (n)/n, where N (an) is the number of revolutions of the point T7°X around 
Aa--oo 


the point x=zx=0. It is obvious that 2 (E) =o only if E = 0(nonlinearity). 

We call the curve £ = const a resonance curve if Q (E) = m/n where m and 
n are integers. If the point X lies ona resonance energy level, then 
T"X = X, and the sequence X, TX,... consists of a finite number of different 
points. At nonresonance energy levels, 7*°X xX and the sequence X,7TX.,,... 
is dense in the curve E = const. 

Now let e#:0. In this case the energy & is not conserved and the points 
T°X do not lie on the curves £ = const. It turns out that for smalle, the 
pattern described above is deformed as follows. Most of the nonresonance 
curves E= const give rise to almost closed curves [ which are invariant 
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under the perturbed transformatiun 7. Fora Point x lying on such a curve 
Tr, the sequence 7"X is dense in [. 

In the presence of perturbation the resonance curves disintegrate. 
Instead of the curve Q (E) = avn, there are 2kn points X for which 7°X =X. 
Of these points half are stable (elliptic) and half unstable (hyperbolic). The 
resulting pattern is depicted schematically in Figure 1. Note the compli- 
cated behavior of the separatrices of the hyperbolic points. 

The existence of the invariant curves f implies, in particular, the 
stability of the pendulum's oscillations: for a sufficiently small perturbation 
¢, the change E (#) —E (0) in energy remains small for all t, -w<t<+oo. 


The curves I are found using a version of perturbation theory suggested 
by Kolmogorov in 1954. This method is similar to Newton's method of 
tangents for the solution of algebraic equations. The first approximation 
coincides with the first approximation of asymptotic methods. It is well- 
known that the system of succesive approximations of perturbation theory 
diverges (small denominators). In Newton's method the error in each 
approximation is of the order of the square of the error in the previous one. 
This rapid convergence overcomes the influence of the small denominators 
and the method yields converging approximations. 

§$ 3. Some results. Since 1954 the indicated method has been developed 
by the author, J. Moser, and others. We shall not give a detailed summary 
of all the results, * but only a few typical ones. 

A. Let 7 be an area-preserving mapping of the plane onto itself (the 
example of § 2) given in polar coordinates r,@ by the formulas 


("+ (7 426 9) } where OS oe 
P @P+A+ Ar? + O(r, 9) ID|< Cr. 
If 4 =e 2nm/3, 2nin/4 and A, 0, then the fixed point 0,0 is stable in the 


Lyapunov sense. 


¢ For references see Arnol’d, V.I. Malye znamenateli (Small Denominators).—Uspekhi matematicheskikh 
nauk, Vol. 18, No. 6 (114): 91—192. 1963. 
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The theorems of the stability of the equilibrium positions and the 
periodical solutions of Hamiltonian systems with two degrees of freedom 
are similar inform. These theorems imply, in particular, the stability 
of the Lagrangian of the periodic motion of the restricted three-body 
problem. 

B. Consider a heavy rigid body fixed at an arbitrary point 0 and rotated 
rapidly. It turns out that the magnitude and inclination to the horizontal of 
the angular-momentum vector always remain near their initial values. In 
particular, if the body is rotated rapidly about the major or minor inertia 
axis, then the axis of rotation in the body will always remain near that axis 
(in space it will precess slowly around the vertical). 

C. Another result is the perpetual adiabatic invariance of the action 
variable in the case of slow periodic variation of the parameters of a 
nonlinear system. Consider, for example, motion in a potential well: 


pees, Fp 
~~ ox’ y= Oy’ 


where 
va=(ifexy£, 


At the "bottom" of the well the particle may escape to infinity, if y=y=0. 
It turns out that for small e the well traps particles with + 4° + 0; 
sup|x|(Q<00o. This follows from the perpetual adiabatic invariance of 


Jy =P tery | 
2(1 + ets*) 


The increment |/, () — /, (0) | is small for all ¢t, ~o <t< + oo. 

§ 4. Multi-frequency motion. All previous examples refer to systems 
with two degrees of freedom. The proofs are based on finding multi- 
frequency conditionally periodic motions. Such motions also exist in 
systems with many degrees of freedom. 

Consider, in particular, the three- (or many-) body problem. If the 
masses of the ''planets'"’ are sufficiently small relative to the mass of the 
central body, the motion is Keplerian 
in the first approximation. Let us 
assume that the initial conditions are 
such that the eccentricities and inclina- 
tions of the Keplerian ellipses are small, 
and the bodies move along the ellipses 
in one direction. 

It turns out that for most initial 
conditions of the indicated type the motion 
is conditionally periodic (in particular, 
it is bounded in an infinite time interval). 

The use of conditionally-periodic 
solutions in order to ascertain the 
stability of all motions is based onsimple 
topological considerations. Consider a 
conservative system with s degrees of freedom and a 2s-dimensional phase 
space. In this space an s-frequency motion is represented by the winding 
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of a A-dimensional invariant torus M(one can visualize the case s = 2—an 
ordinary torus withangular coordinates g, and 9, and motion 9, =, 9, =, 
with frequencies o,and w,). The energy integral divides the 2s-dimensional 
phase space into 2s — l-dimensional invariant manifolds E =const. If s = 2 
the two-dimensional invariant toruses M lie in a three-dimensional energy 
level E = const, resembling coaxial tubes (Figure 2). They partition the 
manifold E = const , though not filling it completely (Figure 1). Trajectories 
in the gaps between the toruses M cannot intersect them and always remain 
between them. In systems with two degrees of freedom, stability therefore 
follows from the existence of the toruses M. 

§ 5. Instability. Now let the number of degrees of freedom be s>3, 
for example, s=3. The energy level is of dimension 2s —1=5, and the 
invariant toruses M of dimension s=3. But a three-dimensional manifold 
does not partition a five-dimensional one (just as a point does not partition 
a plane or a straight line a space). The gaps between the invariant toruses 
M therefore communicate with each other. In the region consisting of these 
gaps instability may occur. 

In fact, consider a system with angular coordinates gq, and q,, 


a=—- ze R=— gee where V = e(cosq, — I) {1 + p (sings + cos/)). 


Here the phase space 9), 9, 9, 9, f is five-dimensional, e and p are small 
parameters, and the unperturbed motion has three-frequencies (@, = ws, f = 1). 
The quantity 9, = /, does not have secular variations in all the approxima- 
tions of perturbation theory. It turns out that for arbitrarily small e,p,¢@ 
there are initial conditions for which /,(0)<1,/, > 1. These initial 
conditions are situated in the resonance zone 4,=/,~Ve. The secular 


variation of /, is not revealed by asymptotic methods, because its rate is 
of order exp (—1/Ve). 
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APPLICATION OF STABILITY THEORY TO STATIC 
AND DYNAMIC STABILITY PROBLEMS OF POWER 
SYSTEMS 


1. The problem of stability as a constituent part of the 
problem of reliability and economy of a power system 


The problem of power system reliability is currently regarded, on the 
one hand, as that of determining the necessary power and energy reserves 
and, on the other, as that of the stability of the steady-state conditions of 
the system. The solutions of these two problems are applied almost 
independently of each other, and no appropriate allowance is made for the 
daily and seasonal variability of the operating conditions of the system. 

Until recently (and in the overwhelming majority of the literature also 
currently) determination of the emergency reserve has been based on the 
probability of accidental loss of generated power by the so-called unit failure 
probability p. The inadequacy of such an approach is clear from the fact 
that p, i.e., the ratio of the total idle time of the unit to its total exploita- 
tion time, does not indicate the frequency of failures and their daily and 
seasonal distribution. For instance, the different effects of night and day 
failure, or, if the system contains a hydroelectric station, of failures in 

the high-water period and low-water period, is obvious. 

The stability problem consists in determining the stability or instability 
of the system for given values of its parameters under a given perturbation, 
or in determining values of the parameter which ensure stability, without 
determining the probabilities of stability violation. The formulas given in 
/1/ for the probability of stability violations do not solve the problem (even 
if one agrees with the underlying assumptions), since in general the problem 
of estimating this probability cannot be effectively formulated and solved by 
the theory of random events, from which standpoint it is considered in /1/, 
but needs the more advanced theory of stochastic processes. 

From the viewpoint of the reliability of electric power supply to 
consumers, there is no essential difference between stability violations 
and other failures, and these violations should be taken into account in 
estimating the total probability of continuous electric supply on a par with 
failures in stationary units, breaks in electric transmission lines and so 
on. The reliability problem is, in the final analysis, the problem of the 
economy of the power system, since a failure in power supply to consumers 
entails commercial loss or inconvenience for the public, but, with extremely 
rare exceptions, does not have disastrous results. In operation this problem 
is an organic part of the problem of the economically most advantageous 
operating conditions of the system, and, at the planning stage, of the problem 
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of the most advantageous configuration of the system, parameters of the 
stations and transmission lines. 

The restrictions, natural or otherwise, governing the control functions 
x, of the system's operating conditions (such as the loads on the stations, 
the current in the transmission lines, the voltages at the junctions of the 
network, the filling of the water reservoirs of the hydroelectric station, 
etc.) may be formulated as inequalities 


F (x1, 6-0. Xmi Yrsecer Yay G1y ++ 0) Gr) > 0. (1.1) 


Here y;,...,Y, are random functions of time expressing the external 
conditions (natural discharge into the water reservoirs of the hydroelectric 
station, the loads on the supply transformers etc.), and a,,...,a, are the 
parameters of the system. The simplest of inequalities (1.1) express 
obvious circumstances, such as the fact that the load on the stations cannot 
exceed the available power or that the water level in the reservoir lies 
between the dead storage level and the normal backwater level. 

The most complicated inequalities refer to stability. Failure of equip- 
ment, short circuits, nonselective operation of protective devices, etc., 
lead to a situation in which the parameters a, do not remain constant, but 
are random time functions. 

The combined problem of economy and reliability at the operating stage 
consists in regulating the choice of control functions x, to ensure minimum 
mathematical expectation of the annual operating expenditures subject to 
satisfaction of inequalities (1.1). That part of these expenditures which 
depends on the choice of a strategy consists of the cost of fuel for the 
thermal power plants of the system and of losses due to deficient power 
supply to consumers. 

The economy and reliability problem is considered in this formulation 
in /2—4/, though not all the inequalities (1.1) are taken into account. 


2. Inequalities relating to stability 


The literature on stability of power systems uses terms /5/ to which 
correspond the following concepts of general stability theory. 

1. Static stability—local stability (under infinitesimal perturbations of 
the steady-state conditions). 

2. Dynamic stability — global stability (under finite perturbations of 
conditions to be established upon termination of the perturbation). 

3. Resultant stability asymptotic stability. 

4. Synchronous dynamic stability — Lyapunov stability, ordinary (for 
example, in the conservative idealization case) or asymptotic, under 
perturbations which do not cause deviations of more than 2x in the angular 
coordinates (determining the positions of the rotors of the units) from their 
values in conditions established after termination of the perturbation. 

5. Stability with respect to a given station—stability ina given coordinate. 

These definitions assume the power system to be an autonomous dynamic 
system. , 

Henceforth, dynamic stability will always mean ordinary asymptotic 
stability, since the difference between synchronous stability and resultant 
stability is not essential. 
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Those inequalities of (1.1) relating to stability always express, in 
particular, the conditions for static stability of a system. In the presence 
of random perturbations, the conditions for static stability under a load 
equal to the mathematical expectation of the fluctuating load must be 
satisfied. 

If there are weak links in the system, uncontrollable power currents 
through these links become appreciable. This is due to the fact that 
automatic control in power systems responds to relatively slow power 
fluctuations, but not to rapid oscillations. The amplitude of the latter is 
commensurate with the capacity of the weak link, and these oscillations 
should be regarded as a probabilistic process with unbounded distributions. 
Here we have a case when, under constant perturbations, the ordinary 
well-developed theory of stability, which assumes them to be bounded, is 
inapplicable. 

Let us assume that there are noconstantly operating random perturbations. 
Until a short circuit occurs the vector a, whose components are the para- 
meters a, in any of inequalities (1.1), has the value a. Due to the short 
circuit, this vector assumes the value a), and after the damaged line is 
repaired —the value a), The system then reaches new steady-state 
conditions if a @L, where L is a set determined by the dynamic stability 
and depending on a, a), and the time required to repair the short circuit. 
If a) EL, trouble will develop, which may, for example, cause the system 
to split. Following repair, the left-hand side of (1.1) will be different in 
the cases a GL anda” ZL. Introducing a random variable B defined by 
B= 1 in the first case and B = 0 in the second case, we may write (1.1) as 
a single inequality for both cases. | 

The time elapsing from the occurrence of the short circuit to practical 
stabilization of the system after its disconnection (the probability of a second 
short circuit, or of any other failure, during this time may be assumed 
zero) is insignificant from the economic viewpoint and the transition from 
a and a) may be consideredinstantaneous. Some time after this transition 
the vector a assumes a new value, due to another short circuit or to 
liquidation of the aftereffects of the stability violation caused by the first 
short circuit, and soon. Thus, the reliability problem, as far as stability 
is concerned, consists, first, in establishing inequalities (1.1) on the basis 
of static and dynamic stability calculations, and second, in determining the 
probability characteristics (distribution functions) of the discontinuous 
stochastic process of changes in the time vector function a. In this case 
it is necessary, of course, to take into account that the changes are due not 
only to short circuits but also to other causes, such as disconnection of 
certain units (for example, lines) for repairs or gradual changes in the 
consumer load. Inthe presence of constantly operating random perturbations, 
the vector a will be influenced by stability violations caused by these 
perturbations; this must be reflected in the appropriate calculations. 


3. Stability problems for simple systems 


There are two principal approaches to the study of the stability of power 
systems: 1) rigorous (exact or approximate) solution of stability problems 
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for simple power systems (e.g., a generator working through a line into 
receiving D.C. busbars, systems consisting of two or three generators of 
commensurate power, etc.); 2) intuitive engineering solutions for multi- 
machine systems, which consist in reducing them to simpler ones. The 
intuitive character of the second approach is due to the great complexity 
(mainly the multidimensionality) of the corresponding problems. 

The first approach is valuable in that it reveals the essential features 
of the transient processes, which are characteristic in many cases not only 
for simple systems but also for general systems. Apparently, their value 
is also preserved when rigorous approaches to multimachine systems can 
be found. 

The first approach is based on the actual possibility, under certain 
conditions, of decomposing the total transient process in a power system 
into components of different physical nature. This in turn makes it possible, 
when these conditions are satisfied, to consider independently the stability 
of hydraulic processes in the water intake of a hydraulic power plant (in the 
pressure of so-called ideal regulators), electromechanical processes (when 
the power of the primary engines is constant), the control stability of the 
turbines. In this connection, the past decade has seen the publication of a 
few results obtained by global consideration of the stability of hydraulic 
and electromechanical processes in simple dynamic power systems. A 
review of the corresponding papers is given in /6/.* Here we consider the 
most important results to appear since /6/. 

In 1961 a paper appeared /7/ which considers a conservative idealization 
of the problem of global stability of the parallel operation of three synchro- 
nous generators (the research was carried out mainly in 1945, but was not 
published at the time). The paper is based on considerations of A.A. Gorev 
concerning the influence of finite perturbations on the stability of a given 
system /8/ and the solution uses methods of bifurcation theory. The content 
of the paper is as follows (in view of the extreme brevity of /7/ it is given 
almost literally). 

According to A.A.Gorev, the equations of motion of three parallel- 
working machines whose configuration is given by the angles 6,, 6,, 6, can 
be written in the form of the Lagrange-Maxwell equations 


: (%)— 3%, = (i= 1, 2, 3), (3.1) 


aI 


where 
T -5 (4,62 + 2,6 + 63), 
U = p,8; + pis + PBs + 91s 00S (8; — Os) + Giz COS (8: — 85) + 7 (3.2) 
+ Js cos (8, — 8,). 


Here p, and g, are constants depending on the operating conditions of the 
system, and 


Pit Pst pPe= 0. (3.3) 


(This condition states that the total power fed into the system through the 
shafts of the machines is equal to the total power consumed in the network. ) 


* See also /19, 20/, 
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Using the first integral of system (1) 


k8: ++ kOs + ks = const (3.4) 
and the substitutions 
9=0,—08, p=%—9s; (3.5) 


the problem is reduced to a conservative system with two degrees of 
freedom, for which the kinetic and potential energies are respectively 


2 . : : : 
T= pe leaks (@ — B+ ha (ead? + ba*)), 
— U = pre + Pa? + 912. COs (P — ) + 919 COS P + x3 COS H. (3.6) 
The coordinates g* and »* of the equilibrium state of this new system, 


which correspond to rotation of the machines with constant mismatch angles, 
can be found from the conditions 


ou 

wv _9g, 

ns ~ (3.7) 
au 

xy = 0 


The solution 9 =9*,»=** of system (3.7) corresponds to a locally 
stable equilibrium state if the function U has a maximum at the point 9°, »*. 

Suppose the system is in a stable stationary state and then the parameters 
Pi Pe Gis) M13 ANd gy aSSuMe new values instantaneously (or during some finite 
time interval). Then the mismatch angles 9=q (t) and p= ¥y (t) either remain 
bounded (stable transition), or at least one of them will increase without 
limit (unstable transition). 

This problem is solved by a method indicated in /8/, similar to that of 
Hill in his well-known investigations of lunar motion /9/ (in an idealization 
corresponding to the restricted three-body problem). 

If 4} = T —Uis the energy of the system immediately after the transition, 
then, since the system is conservative, the subsequent motion of the system 
also satisfies 


—U<h. (3.8) 


Inequality (3.8) defines a certain domain of values for the variables @ and wy. 
For a transition to be stable it is sufficient that the connected component 

of the domain containing the point 9*,»* be bounded immediately after the 
transition. The boundary of this region is the curve 


U (@, >) = Ai. (3.9) 


The problem is to determine the dependence of these curves on the 
parameters p,; and q. 
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Since @ and » are angular variables, the set of values of » and pis a 
two-dimensional torus. The problem thus reduces to studying the partition 
of the torus surface by the family of curves 


H (9, ¥) = aq + By + cos (p—¥) + 1c0s + dcos p = h, (3.10) 


where a= p,/q,, and so on (see (3.6)). 
These curves are the phase trajectories of the system of equations 


- een (3.11) 
=—% =—atsin(g—y¥) +1sing. 


For a conservative system a qualitative picture of the partition of the 
torus surface into trajectories can be found using of the energy integral, 
as in the case of the phase plane. In accordance with (3.10), this integral 
can be represented as 


H(@, ~) = ap + BY + G(q, ) =const, (3.12) 


where a and 8 are constants, and G (9, ¥) is a periodic function of p and » 
with period 2n. 

The partition of the torus into trajectories forms a number of cells of 
the same type as in the plane, whose boundaries are the separatrices of 
the saddle points, and one cell of a new type Dap. The cell Da,» consists 
of closed trajectories of the type (m,n), if a and 6 are commensurable and 
a/p = m/n (mand na being integers), and of nonclosed Poisson-stable 
trajectories, if a and f are not commensurable (periodic motion of type 
(m,n) means that in moving along the corresponding closed trajectory, the 
variable @ changes by 2xm, and the variable » by 2nxn). The cell Da,is a 
torus with a number of "holes". 

To study the behavior of a conservative dynamic system on a torus as 
dependent on the parameters, we may restrict ourselves to bifurcations 
which modify the structure of the torus partition into elementary cells. 
Such bifurcations occur in the following two cases: 

1. The system 


=o, F=0 (3.13) 


has roots g*,»* for which 


+= (ESE —(RE Nao 


2. The system of equations (3.13) has two solutions: 9, }, and 9g, for 
which H (9;, ¥,) = H (,. 9) 

It is pointed out in /7/ that this makes it possible to find out all possible 
modes of partition of the torus into trajectories for arbitrary values of 
a, B, y, 6 in equations (3.11), and to find domains in the parameter space 
corresponding to each of these modes. In turn, this implies a number of 
general statements as to which finite perturbations lead, under conditions 
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of conservative idealization, to unlimited increase in the mismatches of 
the machine operation, and which keep these mismatches finite. 

Recently some other papers have appeared (particularly, /10—12/) in 
which global stability of a conservative system of three parallel-operating 
generators is determined, as by Gorev, using the first integral of the 
equations of motion. However, these papers essentially repeat some 
results of /8/. The paper /12/, which sets forth a graphical method 
simplifying the corresponding calculations, is of some interest. 

The paper /13/ studies global stability of the nonlinear nonconservative 
problem of energetics. It considers a salient pole generator working 
(through a step-up transformer and transmission line) into the busbars of 
a receiving system of infinite power. It is assumed that the excitation of 
the generator is controlled by a strong-action regulator. The phase space 
of the problem is three-dimensional. Using the smallness of certain 
parameters (corresponding, under the conditions of the problem, to a small 
degree of nonconservativity), the author establishes an approximate relation 
between the phase variables, after which he passes from the phase space 
to the phase plane. 

The continuation of /13/ deals with the construction of the attraction 
region of a stable equilibrium state in the plane. Leaving the details aside, 
we note that, the results of /13/ notwithstanding, the problem of complete 
qualitative study of the behavior of nonconservative power systems witha 
three-dimensional phase space is still essentially open (especially if the 
system is not almost conservative). 

The only, but unfortunately incorrect, attempt toformulate and solve the 
stability problem in the case of continuously acting random perturbations 
is made in /14, 15/. 

The problem may be formulated as follows. Let z (i= 1,....n) be the 
generalized coordinates of the system under consideration, z = f,( the 
solution of the equations of the system in the absence of perturbations, and 
z= z(t) the solution in the presence of random perturbations whose mathe- 
matical expectation is assumed zero. If the condition [z, (0) —f, (0)}*+ ...+ 
+ {z,(0) —/, (0)]* << e® is satisfied, then for t<f the condition [2 ()—/, (t))}?+ 
+.--+ [za (Q—fa(O}? & e* is satisfied. Here ¢* is a random variable. It is 
required to determine the distribution function of f* for given e and random 
initial conditions, when the quantities z, (0),...,2, (0) are given by their joint 
distribution. In practical calculations, e is determined by technical 
considerations. 


4. Reduction of complex to simple systems 


The advent of computers has made it possible to solve the equations of 
transients (numerically, on digital computers, by simulation, on analog 
computers) for given initial conditions, i.e., to make a direct check of the 
stability under a given perturbation for fairly complex systems. Nevertheless, 
the second method indicated at the beginning of section (3) has not been 
superseded, since concrete systems with hundreds and thousands of 
generators are replaced not by simple systems but by computation schemes 
with tens of generating units. This is described, for example, in /16/ for 
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the Swedish power system. In such cases the same computer programs 
are usually used for both dynamic (global) and static (local) stability 
calculations, where they obviate tedious operations with characteristic 
determinants. 

Up to the present a fair number of investigations on the " equivalentiza- 
tion’ of power system units, i.e., replacing several units by a single 
"equivalent'’ machine, have been described. Such a replacement is in order 
when the machines being replaced and their controllers are identical, 
equally loaded and occupy the same position in the system, and the 
perturbations in question do not cause mutual oscillation of the machines. 
But even if these conditions are satisfied, one must be sure that the system 
is stable under infinitesimal perturbations of the individual machines being 
replaced by a single equivalent one. That stability of this kind may be 
neglected has become apparent in practice, for example, in the adjustment 
of the strong-action excitation regulators (i.e., operating according to some 
functional of the regulated quantity), of the Volga Hydroelectric Station im. 
Lenin. 

In all other cases "equivalentization' of units, i.e., simplification of the 
network of the power system under consideration, is a somewhat arbitrary 
method. It is more natural, correct, and effective not to attempt to simplify 
the network of the power system, but to simplify the equation system of the 
transients; this does not necessarily imply its reduction to a system of 
equations describing analogous cases in a simpler power system. Sucha 
reduction is meaningful only for simulation, in the precise sense of the 
word, of power systems by physical models. In practice, however, physical 
models of power systems usually play the role of stands for testing equip- 
ment, which require not exact similarity but only plausible reproduction 
of natural conditions. 

During the past decade papers have been published in which linearized 
systems of equations of transients in parallel operating machines are 
simplified, i.e., static stability is considered. These papers, which make 
use of the specific properties of the linear equations of concrete problems, 
will not be considered here. 

One general method for simplifying the initial equation system of a 
complex power system, not connected with the specific features of linear 
systems, consists in separating the motions of the power system into rapid 
and slow. The mathematical aspect of this method is the theory of differen- 
tial equations with small coefficients; the literature of the subject is 
extensive /6, 17/. The method is widely applied in oscillation theory, and 
there is no need to dwell on its characterization. Its applications to certain 
power problems are considered in /6/. 

Recently this method has yielded a practically important result in the 
problem of interaction of electrical, mechanical, and hydraulic processes 
in power systems and electric stations /18/; it has been shown that within 
a wide range of operating conditions for a hydraulic power plant in a power 
system, one can achieve convergence to equilibrium under infinitesimal 
and finite perturbations, of transients in the water intake installations, for 
any parameters of these installations, by suitable adjustment of the velocity 
controllers of the turbines. In other words, hydraulic stability does not 
entail expensive construction work, but only adjustment measures which 
do not require additional expenditure. 
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In this problem the slowest motions —hydraulic oscillations —are 
considered. In investigating the faster electromechanical oscillations, 
separation of motions according to their velocities is not always effective. 
This is clear from the example of a symmetrical power system in the form 
of a regular polygon with identical generators at the vertices, and loads at 
the centers of the sides. In this case the differences between the corre- 
sponding generalized coordinates of neighboring generators may be neglected 
(if the perturbation does not lead to sharp deviations in these coordinates 
at the beginning of the transient process); the small distance between the 
generators may vary no faster than the coordinates themselves. 

Suppose the system of initial equations is reduced to normal Cauchy 
form 


dz 1 
St Fe (ers = 4 2a): (4.1) 
Hence, 
Spa op) a ete 2n) —a— (2 Zn) (4.2) 
dt f ) = FO le -e+ep 2A T, ?s Ae woes Sa} . 


If the absolute value of the right-hand side of (4.2) is smaller than a 
sufficiently small positive number and is integrable with respect to ¢, then 
z-2,+const. This relation holds for the angular coordinates of the rotors, 
and (for const = 0) for the voltages on the terminals of the generators of 
neighboring (i.e., with small mutual impedance) electric stations. Trans- 
formation of the initial equations on this basis is considered in /18/. Here, 
of course, it is also necessary to check local stability with respect to 
mutual oscillations of the machines for which the indicated relations are 
used—if there is no such stability, the right-hand side of (4.2) cannot be 
absolutely integrable. It can be easily shown that, mathematically, 
neglecting the differences between similar motions is also justified by the 
properties of systems of differential equations with sharply differing 
coefficients. 


5. Integro-differential equations 


A complex power system with sufficiently many units may be considered 
approximately as a system with distributed parameters /18/. To this end 
the numbers A;=N;(N,;+..-+Na)", x1 =A, +...+ 4, are associated with the 
i-th unit (installed capacity N,), and an independent variable x assuming the 
continuum of values 0< x <1, including all the values x,, is introduced. 
The corresponding generalized coordinates of the different units are 
considered not as a set of a functions of ¢, but as a single function of ¢ and 
». The equations of balance of the moments on the shafts of the units, the 
emf equations then take the form /18/: 


Ta (2) PS =S(t, x)—E (t, 2) (FE sin t0(¢, x) —O(¢, y)—a (x ydy, (5.1) 
° 
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Ea (t, 2) = E(t, 2) +) FE cos 10(¢, )—9(6, y—a(x, y)) dy, (5.2) 
6 


U(t, x) = {E(t x) —( EY cos (ot, x) —O(t, y) —a (x, y)i dy" 4 


’ X (x. y) 
1 
E(t, y) : ea £4 3 

+ Vaan ay) sin (6(t, x) —O(t, y) —a(x, y)} dy} , (5.3) 
Zp, © 7 4 
ee ee ee 

(x. ¥) Zin’ Y (x, y) aes a X (x,y) "2 ; 

ta i-1 5p) 1 


where 7, is the inertia constant of the rotor, related to the power of the 
unit, S is the mechanical moment, related to the nominal mechanical 
moment of the unit, and the remaining notations are the conventional ones. 
The remaining equations (of the prime movers, excitation systems of the 
generators, regulation of the units) are similarly transformed. 

No restrictions other than (5.4) are imposed on the functions X,Y, and 
Z, i.e., these may be step functions. In this case the above integro- 
differential equations are only a new representation of the previous problem, 
permitting the application of new methods, which may turn out to be more 
effective than the old ones, to the given problem. If, however, X,Y, Z, 
and the other functions characterizing the parameters of the system (such 
as T.) are well approximated by fairly smooth functions (which is apparently 
possible in most cases), then the unwieldy system of ordinary equations is 
approximated by a fairly compact integro-differential system. 

In problems of static stability, i.e., stability under infinitesimal 
perturbations, this system becomes linear, and substitutions of the type 
AO = g (xe! reduce it to a system of homogeneous integral equations. 
Stability under infinitesimal perturbations is ensured if the real parts of 
all the eigenvalues p of the latter system are negative. Investigation of 
global stability, under a given perturbation, is possible by numerical 
solution of the system of integro-differential equations, which is a 
comparatively simple matter. A more profound analysis of global stability 
requires a generalization of Lyapunov's theory to continuous systems. 


Bibliography 


1. Venikov, V.A., Yu.M. Gorskii, and L.A. Soldatkina. Metody 
teorii veroyatnostei v primenenii k analizu rezhimov elektroener- 
geticheskikh sistem (Applications of Methods of Probability 
Theory to the Analysis of Operating Conditions of Electric 
Power Systems).—In: Sbornik ''Primenenie veroyatnostnykh 
i statisticheskikh metodov k analizu rezhimov energosistem,"' 
Kiev, Gostekhizdat USSR. 1963. 

2. Tsvetkov, E.V. Veroyatnostnaya metodika naznacheniya optimal'nykh 
rezhimov energosistem s gidrostantsiyami dlitel'nogo reguliro- 
vaniya (Probability Methods of Assigning Optimum Operating 
Conditions to Power Systems Having Hydraulic Power Plants with 


24 


10. 


12. 


13. 


ARONOVICH and KARTVELISHVILI 


Long-Term Regulations). —Trudy Vsesoyuznogo Nauchno-Issledo- 
vatel'skogo Instituta, Elektroenergetiki, No. 13, Moskva, 
Gosenergoizdat. 1961. 

Kartvelishvili, N.A. Uravneniya optimal'nogo rezhima energe- 
ticheskoi sistemy kak veroyatnostnogo protsessa s nepreryvnym 
vremenem (Equations of the Optimum Operating Conditions of a 
Power System as a Probabilistic Continuous-Time Process). — 
Prikladnaya Mekhanika i Tekhnicheskaya Fizika, No. 1. 1964. 

Tsvetkov, E.V. Ispol'zovanie veroyatnostnoi metodiki naznacheniya 
optimal'nykh rezhimov energosistem s gidrostantsiyami pri 
proektirovanii (Probability Methods of Assigning Optimum 
Operating Conditions to Power Systems with Hydraulic Power 
Plants, as Applied in Design). —Sbornik ''Primenenie veroyatnost- 
nykh i statisticheskikh metodov k analizu rezhimov energosistem", 
Kiev, Gostekhizdat. USSR. 1963. 

Azar'ev, D.I., V.A. Venikov, I.V. Litkens, L.G. Mamikoni- 
yants, M.G. Portnoi, andS.A. Sovalov. Osnovnye 
polozheniya po opredeleniyu ustoichivosti energeticheskikh sistem 
(Basic Assumptions in Determining Stability of Power Systems). — 
Elektrichestvo, No. 11. 1963. 

Aronovich, G.V., L.N. Belyustina, N.A. Kartvelishvili, 
and Ya.K. Lyubimtsev. Zadachi ustoichivosti statsionarnykh 
rezhimov gidroelektricheskikh stantsii i energeticheskikh sistem 
kak zadachi teorii kolebanii (Stability Problems for Steady-State 
Conditions in Hydraulic Power Plants and in Power Systems, as 
Problems of Oscillation Theory).—Prikladnaya Mekhanika i 
Tekhnicheskaya Fizika, No. 3. 1961. 

Andronov, A.A. and Yu.I. Neimark. Teoriya ustoichivosti v 
bol'shom parallel'noi raboty sinkhromnykh mashin i problema 
Khilla (Theory of Dynamic Stability of Parallel Operation of 
Synchronous Machines, and Hill's Problem).— Trudy Tashkents- 
kogo Politekhnicheskogo Instituta, No. 20, pp. 132—137. 1961. 

Gorev, A.A. Vvedenie v teoriyu ustoichivosti parallel'noi raboty 
elektricheskikh stantsii ch. 1. Leningrad, Izd. Komissii po 
uluchsheniyu Byta Uchashchikhsya. 1936. Tozhe. (Introduction 
to the Theory of Stability of Parallel Operations of Power 
Stations). —Izbrannye trudy po voprosam ustoichivosti elektri- 
cheskikh sistem, Moskva, Gosenergoizdat. 1960. 

Hill, G.W. Researches in the Lunar Theory.—Amer. J. Math., Vol. 1, 
Nos. 5—26, pp. 129—147, 246—260. 1878. 

Aylett, R.D. The Energy-Integral Criteria of Transient Stability 
Limits of Power Systems.—IEE Monographs, No. 308, pp. 527— 
533, July, 1958. 

Szendy, Ch. and B. Bokay. Transient and Steady-State Stability 
Conditions of a Multimachine System.—Conference Internat. des 
Grands Réseaux Electriques 4 Haute Tension. Sess (15—25 Juin), 
No. 308, pp. 1—16. Paris. 1960. 

Szendy, K. Kriterium der dynamischen Stabilitat in Dreigeneratoren- 
system und dessen Verallgemeinerungen auf Mehrgeneratoren- 
system.—Arch. Elektrotechn, Vol. 47, No. 3, pp. 149—160. 1962. 

Rozovskii, Yu.A. Primenenie kachestvennoi teorri differentsial'nykh 
uravnenii k analizu ustoichivosti elektricheskikh sistem (Applica- 


25 


14, 


15. 


16. 


17. 


18, 


19, 


20. 


$037 


ARONOVICH and KARTVELISHVILI 


tion of the Qualitative Theory of Differential Equations to the 
Stability Analysis of Power System). — Izvestiya Nauchno-Issle- 
dovatel'skogo Instituta postoyannogo teka, No. 6, pp. 238—257. 1960. 

Andreyuk, V.A. andE.A. Marchenko. Prilozhenie teorii 
teorii sluchainykh funktsii k raschetu stationarnogo rezhima i 
ustoichivosti ob''edinennykh energosistem so slabymi mezhsistem- 
nymi svyazyami (Application of the Theory of Random Functions 
to the Calculation of Steady-State Conditions and Stability of 
Interconnected Power Systems with Weak Intersystem Links). — 
In: Sbornik ' Primenenie veroyatnostnykh metodov k analizu 
rezhimov energosistem.'' Kiev, Gostekhizdat UkrSSR. 1963. 

Andreyuk, V.A. Metodika rascheta nadezhnosti parallel'noi raboty 
energosistemy v usloviyakh slabykh mezhsistemnykh svyazei 
(Methods of Calculating the Reliability of Parallel Operation of 
a Power System in Conditions of Weak Intersystem Links).—In: 
Sbornik ''Primenenie veroyatnostnykh i statisticheskikh metodov 
k analizu rezhimov energosistem", Kiev, Gostekhizdat UkrSSR. 1963. 

Nordstrom, B., L. Norlin, and R. Gradin. Etudes de la 
Stabilité sur le Réseaux Suédois de Transport d' Energie a 400 kv. — 
Conf. internat. grands rés. électr. haute tens. Rapp. No. 324, 
pp. 1—52, Paris. 1960. 

Millionshchikov, V.M. Teoriya differentsial'nykh uravnenii s 
malym mnozhitelem pri proizvodnykh v lineinykh topolagicheskikh 
prostramSstvakh (Theory of Differential Equations with Small 
Coefficients in Linear Topological Spaces).—Doklady Akademii 
Nauk SSSR, Vol. 146, No. 5. 1962. 

Kartvelishvili, N.A. O perekhodnykh rezhimak i ustoichivosti 
avtonomnykh dinamicheskikh sistem s bol'shim konechnym 
chislom stepenei svobody (Transients and Stability of Autonomous 
Dynamic Systems with a Large Finite Number of Degrees of 
Freedom).—Izvestiya Akademii Nauk SSSR, Mekhanika i mashino- 
stroenie, No. 4. 1963. 

Sideriades, L. Méthodes de topologie qualitative; applications a 
l'étude des chambres d'equilibre. — Houille blanche, Vol. 17, No. 4, 
pp. 569—580. 1962. 

Aronovich, G.V. Ob ustoichivosti odnoi nelineinoi dinamicheskoi 
sistemy (On the Stability of a Certain Nonlinear Dynamic 
System).—Izvestiya vuzov, No. 4, Radiofizika. 1963. 


26 


F. R. Gantmakher and V.A. Yakubovich 


ABSOLUTE STABILITY OF NONLINEAR 
CONTROL SYSTEMS 


Introduction 


We shall consider control systems containing one or several nonlinear 
units subjected, in general, to external forces. Such control systems are 
described by differential equations of the type 


= = p> ane + 2 BjaPa (Sn) + ¥, (2), 


i 0.1 
On = >) Tanta on) 


A=i 


he RS Licey §; R= 1,i000%), 


where am, Ba and y, are real constants, and without loss of generality 
ga (0) = 0. 

Most of the paper will be devoted to a study of the absolute stability of 
the state = %=...= & =0 in the absence of external action, i.e., when 
~, () =0. To simplify the exposition, and since moreover the principal 
existing results refer to this case, we shall consider mainly the case when, 
in addition, the control system contains only one nonlinear unit «= 1). In 
this case equations (0.1) become 


tm ants + Bele o= 5 tite (0.2) 
b= b=o 
(j =1, 2,...-+). 


An extensive literature is concerned with systems (0.1); most of the 
results refer to the case v=2. (The equations of Van der Pol, Liénard, 
and Duffing, for example, have the form of system (0.1) for the special 
case v=2.) It is now clear that many of these results, originally obtained 
by various special methods (usually connected with qualitative investigations 
of phase curves in the plane), can also be obtained by a single method based 
on Lyapunov functions, and this method (this is particularly important) is 
applicable to systems (0.1) of arbitrarily high order v. 

The center of interest is thus shifted to effective methods for constructing 
Lyapunov functions or, rather, finding effectively verifiable conditions for 
their existence which guarantee their most important properties (absolute 
stability for »() =0, existence of limit motions for wy, () #0 which are 
stable in the large, etc. ). 
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The present report is a survey of papers based on this approach. We 
shall not deal with papers studying particular cases of (0.1) by specific 
methods, or with approximate and asymptotic methods of importance for 
applications. 

We proceed to describe briefly the history of the problem. We shall 
distinguish between the principal case, when the characteristic 
equation of the autonomous system* 


det fajn— Snr] = 0 (0.3) 


has all its roots in the left half-plane, the simple critical case, 

when equation (0.3) has a single zero root and the other roots are in the left 
half-plane, and the general critical cage, when all the roots of (0.3) 
are in the closed left half-plane, including an arbitrary number on the 
imaginary axis. The function 9 (0) in (0.2) will at first be assumed to be 
continuous and to satisfy the condition 


0 < 09 (0) < poo" (Wo < +00) (0.4) 


in the principal case and the condition 
0 < 6p (6) < pe" (bo < +00) (0.5) 


in the critical case. Condition (0.4) implies, obviously, that the graph of 
the function @ (0) is situated in the strip bounded by the straight lines @ = 0 
and @=p,0. 

This approach was initiated by A.I. Lur'e /1/. He considers the 
problem of stability in the large of the trivial solution of the system (0.2) 
with p, = oo in the principal and simple critical cases. A Lyapunov function 
is sought in the form 


v @ 


V = >; entita + © 9 (0) do, (0.6) 
f, hak e 


where ® = 0 for the principal case and @= 1 for the simple critical case, 
and eg are specially selected coefficients depending on v parameters. For 
these v parameters a system of quadratic equations (Lur'e's resolvents) 
is obtained, the existence of a real solution of which guarantees stability 
in the large. For v <4, conditions for the existence of solutions of these 
equations have been found /1/ in explicit form, thus effectively solving a 
large class of problems of practical importance. 

Lur'e's method for systems (0.2) has been developed in different 
directions by A.M. Letov, P.V. Bromberg, V.M. Popov, Yu.I. Alimov, 
O.I. Komarnitskaya, A.K. Bedel'baev and others (see /2, 3/) and in its 
most general and final form by E.N. Rozenvasser /4/. Similar methods 
for constructing Lyapunov functions in the form (0.6) (or in an analogous 
form for the case of many nonlinearities) are considered by I.G. Malkin, 
N.N. Krasovskii, V.V. Rumyantsev, B.S. Razumikhin, E.A. Barbashin, 
V.A. Pliss, Yu.A. Alimov and others—see /2, 3/. In the USA a large 


* In equation (0.3) Sn = 0 for { geh, $= 1, 
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group of scientists (G. Szeg6, La Salle, R. Kalman, G. Bertram, G. Pearson, 
K. Meyer, and others—see /2, 3/) headed by S. Lefschetz is at present 
working in this direction. 

Since Lur'e's method is based on the use of a Lyapunov function (0.6) of 
very special form (the Lyapunov function /4/ depends on (v + 1) parameters, 
whereas the function (0.6) depends on [v (v + 1)/2+1] parameters), it has long 
seemed that the conditions for absolute stability established by Lur'e's 
method can be considerably extended by the use of all Lyapunov functions 
of type (0.6). 

In /5/ it was conjectured and proved for v < 3 that in the simple critical 
case the conditions obtained by Lur'e's method in fact include all conditions 
obtainable using Lyapunov functions of type (0.6). R. Kalman /6/ has proved 
an important algebraic proposition (/6/, ‘principal lemma’), which proves 
the conjecture for arbitrary v and p, = oo for practically all systems (0.2) 
in the simple critical case (with the exception of certain systems corre- 
sponding to a set of measure zero in the space of the parameters ayn, B;, y, ). 
For the simple critical case and p, # oo, a similar assertion is proved in 
/3/, using Kalman's result. * Quite recently a similar assertion for the 
principal and general critical cases has been proved /9/. All these results 
imply that in the problem of absolute stability of system (0.2) the class L of 
all functions of type (0.6) contains a certain optimal function V, with the 
property that if the class L contains at least one Lyapunov function for (0.2), 
V. is also a Lyapunov function. This optimal function V, is the very function 
constructed by a suitable application of Lur'e's method** (for the principal 
case, Rozenvasser's version of Lur'e's method /4/). 

Another very useful approach to the problem of absolute stability of 
systems (0.2) has been developed by V.M. Popov /10—13/ (see also /3/). 
Without using Lyapunov functions, Popov derives (in the principal and 
simple critical cases) a certain frequency condition for absolute stability 
and proves that in the simple critical case this condition includes all the 
conditions obtainable using Lyapunov functions of class L. Popov proved 
a somewhat weaker assertion for the principal case as well (we mention 
that all absolute stability criteria obtained prior to Popov's results, with 
rare exceptions, use Lyapunov functions of class L, and are therefore 
included in Popov's criterion). An advantage of Popov's criterion, apart 
from its generality, is its extreme simplicity in application and its 
formulation in terms of the frequency response of the linear part of a 
system, a term familiar to control theoreticians. 

Popov's method was later developed and applied by Ya. Z. Tsypkin, 

B.N. Naumov, A.Kh. Gelig, A. Halanay, C. Corduneanu and others (see 
/14—20/) to the proof of many new results. 


* To be specific, it was shown in /5, 7/ that in the simple critical case the existence problem for a 

Lyapunov function which is a quadratic form in the principal coordinates plus a nonlinearity integral 
(a somewhat narrower class than that of the functions (0.6)) leads to an algebraic problem — the solution 
of a special matrix inequality (see below, §3, inequality (3.2)). The above-mentioned conjecture refers 
to this inequality (/5/, page 127). Since, as shown in /3/, the same inequality figures in more general 
problems of the construction of Lyapunov functions of type (0.6) (simple special case) or of the class of 
functions (0.6) given by an S-procedure /3/ (principal case), the conjecture /5S/ implies that in all these 
cases Lur’e’s method provides exhaustive conditions. The matrix inequality is solved in /8/ and the 
conjecture /5/ concerning it has been verified by Kalman /6/. 

°° Note that here by Lur’e’s method we mean the method in which the “supplementary parameters” /5, 7/ 
are assumed equal to zero, or “limit” resolvents are considered /3/. 
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In /8/ the matrix inequalities /5/ arising in the search for necessary 
and sufficient (simple critical case) or sufficient, but in a certain sense 
optimal (principal case) conditions for the existence of Lyapunov functions 
of class L and a number of other problems, were solved. This result 
provides a new proof of Popov's criterion which implies that when Popov's 
condition is satisfied the system (0.2) has a Lyapunov function of class L. 
Thus, in the simple critical case Popov's condition is a necessary and 
sufficient condition for the existence of a Lyapunov function of class L. 

Until recently it was not clear whether Popov's condition can be 
strengthened in the principal case by using a Lyapunov function of class L. 
Recently it has been proved /9/ that this is impossible: Popov's condition 
is also a necessary and sufficient condition in the principal case for the 
existence of a Lyapunov function of class L. * 

Since, as mentioned above, Lur'e's method (for the principal case, in 
Rozenvasser's version) also includes all Lyapunov functions of class L, it 
also follows from /9/ that Popov's criterion is equivalent to the conditions 
obtained by Lur'e's method. This was proved before /9/, for the simple 
critical case and p, = oo, by Kalman /6/, and for the simple critical case 
and p», # oo and for the principal case in the book /3/ (appendix). In this 
case substantial use is made of Kalman's result /6/. 

Thus, by the efforts of a number of mathematicians the ''Lur'e problem" — 
the problem of searching for effective necessary and sufficient existence 
conditions for Lyapunov functions of class L for the system (0.2)—has been 
solved. Much research has been devoted to this problem, starting with 
/21/, which first suggested using Lyapunov functions of class L. The 
conditions provided by Lur'e's method (for the principal case, in Rozen- 
vasser's version /4/) and Popov's frequency condition turned out to be the 
required ones. 

The results described above refer to the principal and simple critical 
cases. M.A. Aizerman and F.F. Gantmakher /22/ consider the general 
critical case under the assumption that the graph of the function 9 (0) is 
separated from the critical line e=0, i.e., more accurately, that instead 
of (0.5) we have 


e0* < 09 (6) < p0* (0.7) 


for arbitrarily small e>0. It is shown that Popov's criterion carries over 
to this case, the parameter in his condition being determined as a rule from 
the condition that the frequency response passes near the critical values 
© =, (the numbers + fw, are roots of equation (0.3)). Assuming (0.5), 
Popov's criterion is extended to the general critical case in /23—25/. In 
/25/ it is also shown that, as in the previously considered cases, Popov's 
criterion coincides with the conditions obtainable by Lur'e's method for 
that case, and is also a necessary and sufficient condition for the existence 
of a Lyapunov function of class L. 

The results described above correspond, apart from a few details, to 
the contents of the book /3/ and §§ 2—5 of the present paper. ** Recently 


* More accurately, at present some similar but nonequivalent formulations of Popov's frequency condition 
are known, which refer to similar, but different cases. The statement formulated above holds in all these 
Cases; sometimes the Lyapunov function, while guaranteeing absolute stability, only satisfies the weak 
inequality V<0. 

°° The question asked in Chapter 5, § 4 of /3/ is also answered above. 
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a number of new results have been obtained, concerned with the study of 
systems (0.2) with hysteresis nonlinearities /26—28/, discontinuous 
nonlinearities /15, 25, 29/, and nonunique equilibrium position /18, 30/, 

and also systems (0.1) /16, 31/. Popov's condition has been strengthened 
/32,33/ by using additional information about the nonlinearity (and expanding 
the class L of Lyapunov functions). 

Popov, Halanay /34/ and Gelig /18/ have extended Popov's criterion to 
the infinite-dimensional case (systems containing delay links, or described 
by partial differential equations). Tsypkin /14/, followed by Szego, Kalman, 
Pearson /35—37/ and others have obtained frequency conditions for absolute 
stability of sampled-data control systems. All these results are described 
in §§ 6—12, 

In conclusion we make the following remark. At the present time the 
efforts of a number of authors have yielded many theorems of the Lyapunov 
type, which reduce some specific problem in the theory of differential 
equations to the existence problem for a function possessing some special 
properties, the most important of which is as a rule that its derivative 
should have a fixed sign everywhere or in some region determined by the 
given system of differential equations (the theorems of Yoshizawa /38—39/, 
Barbashin and Krasovskii /40/, Nemytskii /41—42/, Krasnosel'skii /43—44/, 
Zubov /45/, Lefschetz and La Salle /46/, Gelig /30/ and others, concerning 
the stability of the stationary state, stationary sets and manifolds, stability 
being understood in various senses, in the small and in the large: the 
existence of periodic and almost periodic motions, their stability and so on). 
In general it is almost impossible to indicate get methods for the construc- 
tion of such functions. If, however, special systems are being considered, 
such as systems (0.1), and one is looking for existence conditions for the 
required function in some previously delineated class of functions (class L 
or the class of quadratic forms), then the problem usually reduces toa 
purely algebraic one of solving some special matrix inequality. In this 
connection different problems often lead to the same algebraic problems. 
Taus solution of these problems yields the solution of the initial problem 
in the form of effective, usually broad, sufficient conditions (they often turn 
out to be also necessary), since they comprise all the conditions obtainable 
by using Lyapunov functions of the class under consideration. 

Some of these algebraic problems are considered below. Most of the 
results of the present paper can be obtained by the indicated method 
from the solution of these problems. However, many of the algebraic 
problems arising here remain unsolved. 

We now proceed to a detailed exposition. 


§ 1. Vector-matrix notations. Transfer functions and 
matrices. Definition of absolute stability. 


In the sequel it will be convenient to use vector-matrix notation. This 
is due not so much to the need for abbreviation, as to the fact that the 
problems arising pertain essentially to matrix algebra and may be solved 
in that framework. 

As usual, a rectangular matrix with v, row and v, columns is called a 
¥, X Vg matrix. A matrix with v,=1(i.e., a column of numbers) is called 
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a vector, and a matrix with v,=1(i.e., a row of numbers) is called a 

transposed vector. If A is a v, x v, matrix, A°® denotes the transposed v, x v, 

matrix with complex conjugate elements. In particular, if a, bare vx 1 

vectors, then a°} is the scalar product of the vectors a and 6, but ba’ is a 

v Xv matrix. We shall denote by / the unit matrix, i.e., the matrix whose 

principal diagonal contains units and whose remaining elements are zeros. 
Introducing the notation x= ||£,||, P = jlaall, ¢ = Ball. @ (0) = llega ©) II, 

fO= lw Ol. c= Ilyall, o = jlo, GA=1,...,v, R= l,...,x%), we write 

system (0.1) in the form 


F=Pr+qe(s)ti(t, comers, (1.1) 


where P is a vx v matrix, g and r are v x x matrices (and, consequently, 
ris a x Xv matrix), x is av x 1 solution vector, /(9 is av x 1 function 
vector (external action), 0, @(e) are x x 1 vectors. 

System (0.2) can be written in a similar form 


S = Px+q9(9), o= rx, (1.2) 


where for system (0.2) x = 1,0 isanumber, and @() is a scalar function. 
We preserve the notation (1.2) also for an arbitrary x>1. 
Equation (0.3) takes the form 


det (P—A/) = 0. (1.3) 


Consider system (0.1) or (1.1) with x>1. Equating the external action 
to zero and separating nonlinear units, we obtain an autonomous linear 
control system (the linear part of system (1.1) or (1.2)) with inputs @ () = 
= |\q, (||, Outputs o (f) = ||o, (N |] and equations dxr/dt=Px+@(\), o@ =x. 
Introducing at the inputg,=0,..., 9).1—= 0.9, =e, Ow =—0,...,9=0, we 
obtain at the output the values o, = — yy (lw), .. . oe = — Aan (iw)e. The 
values of yj, (io) can be measured, the x’? curves ya (fo) for 0 <a < oo are 
called, as is well-known, amplitude-phase response curves of the linear 
part of the system, the x® functions yp (&’) are called transfer functions, and 
the x Xx matrix 


% (A) = |X yu (ADI (1.4) 


is called the transfer matrix of the linear part of the system. For x=1 we 
have one transfer function y(A4)and one amplitude-phase response y (jw). It 
is easily shown that for system (1.1) with arbitrary x 


4A) = 7° (P — AF. (1.5) 


The transfer matrix , (A) is an invariant of the linear substitution x = Sx, 
in system (1.1), i.e., if substitution yields the system dx,/df = P,x, + 9,9 (0,)+ 
+f, (0,0 = rx, then 7° (P—AlN'q =r, (P, —A/)q,. Other definitions of the 
transfer matrix xz (A) are also known. 

It is important to note that most of the results of the present paper are 
formulated in terms of the transfer matrix , (a) (or transfer function, if 
x= 1), i.e., these results can also be used in the frequently encountered 
case when the system of equations (1.1) or (1.2) is not given and only x (A) 
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and certain general characteristics of the nonlinearities @, (c,) and the 
external actions yw, (f are known. 

Let us assume that some class M of nonlinearities @ (c) = ||9,(c,)|{ is given, 
for example, let g, (o,)) be continuous functions satisfying the conditions 


0< 0; (6) <p} (1.6) 


for fixed numbers y,. 

Our main interest is in conditions for absolute stability of the linear part 
of the system (1.2), i.e., conditions under which the solution x =0 of 
system (1.2) is asymptotically stable in the large for any e()GM* We 
shall say that the linear part of system (1.2) or, briefly, system (1.2) is 
uniformly absolutely stable if 


Ix(t)|< 8 (x, t), (1.7) 


where ®@ (x,, t) is some function depending only on the parameters of the 
linear part of the system and the numbers y, in formula (1.6), but not on 
@eM, such that 6 (4, ¢) +0 uniformly in x, for t — oo, when x, varies in any 
bounded domain and @ (x,, f) ~ 0 for |x,|— 0 uniformly in ¢ in every bounded 
subinterval of 10, 00). 

These conditions have the following meaning. In a uniformly absolutely 
stable system the characteristics of interest (i.e., the bound (1.7)) do not 
change for arbitrary variation of the nonlinearities, as long as the latter 
remain in the class M. In an absolutely (but not uniformly) stable system, 
the order of decrease of an arbitrary solution may change when the non- 
linearities vary, but it remains asymptotically stable in the large. 


§ 2. Lur'e's method 


First consider system (1.2) for the principal case and x=1, assuming 
that @ (cd) is a continuous function such that 


9 (c)o>0. (2.1) 
We require a Lyapunov function from the class L, of quadratic forms 


V = x*Hx, (1° = H). (2.2) 


Since 
V = 2x°Hx m — x°Gx + 2x°Hg9, 


where 


G = — PH — HP, (2.3) 


* Asymptotic stability in the large means, as is well-known, that a) any solution x(f)— 0 for ts ee; 
b) the solution x(f) = 0 is asymptotically Lyapunov-stable. 
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we infer, using (2.1), that the required inequality V <0 for x +0 will be 
satisfied if* 


Hqa=—r, G>O. (2.4) 


Let us try to find a matrix H satisfying these conditions by taking G = uu® + e/, 
where uw is some vector and e>0. It is well-known that under our assump- 
tions regarding the roots of equation (1.3) the solution #4 of the matrix 
equation (2.3) (or, in scalar form, of the corresponding system of v (v + 1)/2 
linear nonhomogeneous equations) is uniquely determined, and H >0, if 
G>0. Let us denote this solution by 


H=L/(G). (2.5) 
In our case, by virtue of the linearity of the operator L (G), 


H = L(uu’) + eL (1). (2.6) 
Substituting in (2.4), we obtain 


L(uu")q+r'=0, (2.7) 
where 
ro =r+eLi(/). (2.8) 


The vector equation (2.7) in scalar notation is a system of v quadratic 
equations for the components of the vector u—Lur'e's resolvents. If these 
equations have a real solution uw for all vectors r’ sufficiently close to r, 
there is, in particular, a solutionufor the vector r’defined by formula 
(2.8) with sufficiently small e>0. Determining the matrix H by formula 
(2.6), we find that (2.3) and (2.4) are satisfied with G= uu’ +e/. Since 
G>O0, there exists 5 >0 such that x°Gx > 26x°Hx, for all x. Then 

V= — xGx — 09 (0) < — 26V, i.e., V [x (0) < eV [x (0)). This means that for 
some constant 7 >0, depending only on H{(i.e., on P,q, andr), but not 

on the function 9(@), we have 


1x(Q|< ve | x (0). (2.9) 


The argument remains valid when 9 = q(s6) is an arbitrary function 
depending on x, ¢ and, possibly (provided (2.1) is satisfied), on certain 
parameters. The function 9(s) may be, in particular, a hysteresis function. 

Let us formulate our result. 

Theorem 2.1. Assume that system (1.2), where x =1, corresponds 
to the principal case (all the roots of equation (1.3) lie in the left half-plane), 
and the output of the nonlinear unit 9 = 90) is an arbitrary function of time 
related to the input so by (2.1). Assume that for all vectors r' sufficiently 


* Here and in the sequel thé inequality @ >0 means that @ is a positive-definite matrix, i.e., x°Gx>0 for 
any vector x40. Similarly, G0 implies that #°Gz> 0 for any x. 
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close to r the quadratic vector equation (2.7) has a real solution a, where 
the operator L(G) is defined by (2.3) and (2.5). Then system (1.2) is uni- 
formly absolutely stable, and, moreover, there exist constants 7 >0, 8>0, 
depending on the parameters of the linear part of system (1.2), such that 
for any solution x(t) inequality (2.9) holds. 

Again, let 9(s) be a continuous function, and instead of (2.1) suppose 


0 < 09 (5) < poo”, (He < + 00). (2.10) 
We require a Lyapunov function of the class L of functions of the form 


V = x'Hz + 0 9(0)do, (2.11) 


° 


where the matrix Hand the number @ are the required parameters. After 
elementary calculations we obtain 


—V = x°Gx + 2x°gg + 19? + (6 — Mo') 9, (2.12) 
where G is defined by (2.3) and 
—g=Hq+5(8P'+/)r, ¥ =p5! — Or°q. (2.13) 
For y >0, V can be written in the form 
Var (G—ree 2+ 7119+ 18's |" + (0 — oH0') 9. 


Since (2.10) implies that (@ —gp,.")p>0, V <0 will hold for x0, if 
H and @ satisfy 


1>0, 1G—ge’>0, (2.14) 


or 


7¥=0, g=0, G>0. (2.15) 


The second inequality of (2.14) will be satisfied if we take g= Vu, 
G= uu* + e/, where e>0 is arbitrarily small. Then His found from (2.6): 
as above, we obtain the resolvent oe 


L(uu")q + V tu +4 (OP* + 1)r’ =0, (2.16) 


which must have a solution for all vectors r’ sufficiently close to r. In 
addition, the parameter @ must satisfy 7 =p," — @r°q > 0, det (OP° + J) #0. 
Relations (2.15) lead, as above, to equation (2.16) with 7r=0. 

If equation (2.16) has a real solution u for all vectors r’ sufficiently close 
to the given vector r and for some parameter 6, then V <0 for x #0: and 
hence, as is easily shown, follows the uniform absolute stability in the class 
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of continuous nonlinearities satisfying condition (2.10). * Equation (2.16) 
(in a somewhat different form) was derived by Rozenvasser /4/. Equation 
(2.7) results from (2.16) by putting p, = o, and @=0. 

We supplement the above results by a few remarks. One could take «=0 
in the above argument. Then, instead of the ''varied'' Lur'e resolvents 
(2.7) and (2.16), 'unvaried'' equations with r’ = r would be obtained. However, 
the existence of a real solution of these equations ensures only the inequality 
V<0, and therefore stability in the large may be deduced only under 
additional assumptions. 

As indicated above, the solvability conditions of equations (2.7) and (2.16) 
for v <4 (and fixed 6) are expressed in terms of the coefficients of these 
equations in /1, 2,5/. These conditions can also be found as follows. We 
determine polynomials a (A) and A (A) from the relation p.-! + (1 + A®)x (A) = 
=a (d)/A (A), where, we recall, (4) = 7° (P —AN-q. 

Therefore, A (A) = det (P —A/). In order that equation (2.16), where y>0, 
have a real solution a fnr all r'sufficiently close to r, it is necessary and 
sufficient that there exist a polynomial & (A) with real coefficients which does 
not have purely imaginary zeros and which satisfies the identity 


a (A) A (— A) + a(—A)A(A) = (A) E(—A). (2.17) 


For 1=0 the additional condition lim o'a(iw/A(io)>0 must hold. 


Equations (2.16) and (2.17) remain valid in the general critical case as 
well, if certain additional conditions are satisfied /25/. 

In concluding this section we note that the above version of Lur'e's 
method may be carried over almost unchanged to the case of many non- 
linearities, i.e., x >1. 

The proof of Theorem 2.1 remains the same word for word, except that 
uis avy Xx matrix. The matrix equation (2.7), in scalar notation, will be 
a system of vx quadratic equations for the elements of the matrix ua. 

For x > 1 one should consider, instead of (2.11), a function 


ai 


V=x°Hx +2 ® 9; (6) doy. (2.18) 


Then, instead of (2.16), we obtain the equation (taking g = u) 
L(uya’)a+u+ 5 (Pr'0,+ r’t,)=0, 7 Tybgt — O4°°9. (2.19) 


where ty and 6, are diagonal matrices with diagonal elements to be chosen 
respectively, +, >0,...,+t.>>0, 0,....6, andpyis a diagonal « x * matrix 
with diagonal elements p,;,....,#.. Conditions for solvability of equations 
(2.19) are known, however, only in a few particular cases. 


° Now, however, this conclusion fs not necessarily true for nonlinearities of a more genera! class than that of 
continuous functions satisfying (2.10). In this connection, the narrower class Le of Lyapunov functions (2.1) 
has some advantage over the class L of Lyapunov functions (2.11); it is useful to bear this in mind in the 
sequel. 
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§ 3. Special matrix inequalities and their solution 


The problem of constructing a Lyapunov function of class L, (see (2.2)) 
or L (see (2.11)) was reduced in § 2 to the following algebraic problems, 
these are more conveniently formulated in somewhat different notations. 

Given: a real Hurwitz v xX v matrix A(the equation (A — 4) = 0 has 
all its roots in the open left half plane), v x 1 real vectors a, 6 and a number 
ep >0.* Using a matrix H=H we define 


G=—(AH+HA), g=—(Ha+6). (3.1) 


It is required to indicate conditions for the existence of a real symmetric 
matrix H such that either 


eG — gg’ >0, (3.2) 
or 
G>0, g=9, (3.3) 
or 
=({% ®\ So 
g=(7.2)> (3.4) 


where in (3.4) we may have p = 0. ** 

We note that in cases (3.2) and (3.3) G>0 and therefore, necessarily, 
H>O0, in accordance with Lyapunov's well-known theorem. 

In the case (3.4), G>0 and, consequently, H>0. Since the Lyapunov 
function must be positive-definite, in this latter case we also need conditions 
for the inequality H > 0 to hold. 

The solution of these algebraic problems is given by the following 
theorems. 

Theorem 3.1. For the existence of a solution H = H* of inequality (3.2) 
it is necessary and sufficient that for all w in the interval 0<@ < o: 


0+ 2Reb°(A —io/)*a>0. (3.5) 


Theorem 3.2. For the existence of a matrix H = H* satisfying (3.3), 
it is necessary and sufficient that: 


Reb’(A—0l)a>0 for 0<e< oo, (3.6) 
lim w* Reb°(A —do/)?2 a > 0. (3.7) 


* The complex case is considered analogously. 
°° The problem of constructing a Lyapunov function of class Le(p = 0 or L (p>>0), such that V <0, leads to 
inequality (3.4). 


37 


GANT MAKHER and YAKUBOVICH 


Theorem 3.3. a) For the existence of a matrix H =H’ satisfying (3.4), 
it ts necessary that 


p+2Reb°(A—iol)"a>0, for 0S w< oo. (3.8) 


b) If the rank of the matrix 
fa, Aa,..., A” aj (3.9) 


ts v, then condition (3.8) is also sufficient. c) The null-space of the 
matrix H ts contained ina subspace which ts orthogonal to that spanned by 
the column vectors of the matrix 


yo, A°b,... A "Oh. (3.10) 


If, in particular, the rank of the matrix (3.10) is v, then H>0. 
d) If (3.8) is satisfied and the rank of the matrix (3.9) is v, then there 
exists a matrix H = H* such that 


G=uu", g=uVp, (3.11) 


where u is some vector. 

We note that (3.11) obviously implies inequality (3.4). Forp>0, 
relations (3.11) can be written in the form pG — gg*® = 0, which indicates their 
relation to condition (3.2). 

Theorems 3.1 and 3.2 are proved in /8/. The proof of part a) of 
Theorem 3.3 is a word for word repetition of the corresponding reasoning 
in /8/. Parts b) and c) can be derived from Theorem 3.1 by passing to the 
limit. A proof of parts b), c) and d) of Theorem 3.3, independent of /8/, 
has been given by Kalman /6/. 

We give a proof of Theorem 3.2 based on the arguments in /6, 8/; to 
simplify matters we assume that the rank of the matrix (3.9) is v. We 
denote Aj = A — iol, a, = (A — iw/y1a. Assuming the required matrix H to 
exist, we have (@ being real)- G=A.H + HAs. Multiplying this relation on 
the left by a. and on the right by a, and substituting Ha = —b (which follows 
from g=0), we obtain 


2Reb* (A — iw!) a = a. Gag. (3.12) 
Hence 
b°Aa = lim wo Reb? (A — iol) a = 5 0°Ga. (3.13) 


Formulas (3.12) and (3.13) imply the necessity of conditions (3.6) and (3.7). 
Now suppose (3.6) and (3.7) are satisfied. We show that a matrix H — H° 
exists which satisfies (3.1) and (3.3). 
We show first that (3.3) follows from (3.12). Substituting G = — (AH + HAs) 
in (3.12) we obtain 


Re(Ha + 6) (A—iol)a= 0. (3.14) 
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The function » (A) = (Ha + b)* (A —A/)-'a is holomorphic in the left half-plane, 
and Rew (io) =0. By the principle of analytic continuation » (A) =—¥ (—A), 
i.e., (A) is holomorphic everywhere, and consequently yp (A) = const. 
Asymptotic considerations for |A| — oo imply that » (A) =0, and by 

virtue of the linear independence of the columns of the matrix (3.9), 

g= —(Ha+6)=0. Thus, it is sufficient to find a matrix G>0 satisfying (3.12) 
(then H will be determined from the first relation of (3.1)). We will look 
for G in the form G = uu’+ e/, where e >0. Substituting in (3.12) we obtain 
we obtain 


2Reb°a, — eaua = |u°a, |*. (3.15) 


From (3.6) and (3.7) it follows that for sufficiently small e> 0 the left- 
hand side of (3.15) is positive for all wo >0. Let A (A) = det (A —A/), 


(A —AN"a = ox. gQ)=a%t+AGqQ,+..- tA" q-,; The condition on the matrix 
(3.9) implies the linear independence of the vectors q, 4.,..., 9v-1. The 
equality (3.15) can be written in the form 8 (ia) = |u°g (i) |*, 


B (io) = | u"g (iw) |?, (3.16) 


where § (A) is a polynomial in A* with real coefficients, defined by the: 
relation B (io) = | A (iw)!*- [2Re 6°a, — eaua.). Since B (io) >0 for wo >0, the 
roots of the equation # (A) = 0 are symmetrical with respect to both the real 
axis and the imaginary axis. Therefore there exists a polynomial § (A) with 
real coefficients which satisfies the identity 


B(A) =&(A)E(—A). (3.17) 


For § (A) we take a product, with suitable coefficient, of the monomials 
(A —A,), where A, are the roots of the equation 6 (A) = 0 in the left half-plane. 
The resulting polynomial §£(A)=&+A8&, +...+A*'§&, is used to determine 
the vector u from the relation & (A) = u’g (A), i.e., from the relations 
t; = ug, ( = 0,1,...,¥—1). This is possible by virtue of the linear 
independence of the vectors gq. When this is done (3.16), and consequently 
(3.15) and (3.11) are satisfied, which completes the proof. 

Theorems 3.1 and 3.3 can be proved similarly. 

In the case of many nonlinearities, problems of constructing Lyapunov 


a @ 
functions of the form V = xHx+ >) ,\ g; do; lead to analogous matrix 

i=} 9 
inequalities, in which now a and 6 are v Xx matrices, andp>Oisa xx x~- 
symmetric matrix. Instead of (3.2) one considers the inequality 


G— go 'g* >0. (3.18) 


Let us introduce the notation Re Z = 3(Z + 2°), where Z is an arbitrary 


complex matrix. The following assertion can be proved. 

Theorem 3.4. Using the above notation, the following remain valid: 
Theorem 3.1 with (3.2) replaced by (3.8), ‘Theorem 3.2 with the condition 
that rank a=x, and Theorem 3.3, where u is a matrix of order v xx; in 
(3.5), (3.6) and (3.8) —w<uw< + oo. 
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In this case the left-hand sides of inequalities (3.5) —(3.8) are xxx 
matrices, and the matrices (3.9) and (3.10) are of order vxvx. 

The problem of necessary and sufficient conditions for the existence of 
a Lyapunov function of class L for system (1.2) with x=1, where @ (oc) 
satisfies conditions (0.3), leads to the following matrix problem. 

Given A, a, b,c, 7, where Ais a vxv real Hurwitz matrix, a, 6, care real 
vx 1 vectors and + is a real number, we determine g and G by formulas 
(3.1) and set F (pn, H)= G+up (gc +cg’)+ tTp%a’. It is required to find existence 
conditions for a matrix H=H’such that 


F(p, H)>0 for 0< p< pio, (3.19) 


where pn, is a given number (for the problem indicated above A = P, a= gq, 
b= 5 OPr’, c=r,7= —?rq.) 


Theorem 3.5. For the existence of a matrix H = H* salisfying (3.19), 
tt is necessary and sufficient that for some «>0andallw>0 


1% + 2Reb°(A— iw!) a + t (ps! + Rec’ (A — iw!) a} > 0. 


The proof is given in /9/, where the case of the weak inequality 
F (yp, H) > 0 is also considered. 

It is important to note that the problems considered above of the necessary 
and sufficient conditions for the existence of Lyapunov functions of type (2.1) 
and (2.11) in the problem of the absolute stability of systems (1.2) are not 
the only ones leading to the matrix inequalities (3.2)—(3.4), (3.19); the 
same is true of various other problems considered below for systems (1.1) 
and (1.2). 


§ 4. Popov's frequency condition 


Consider system (1.2) for x = 1, where y (s) is a continuous function 
satisfying relations (0.4). Applying Theorems 3.1 and 3.2 to the matrix 
inequalities (2.14) and (2.15), we obtain the following sufficient (and 
necessary) conditions for the existence of a solution H = H”’: 


n(o) = pz! + Rel(] + iw?) x (ia)) >0 for 0< woo, (4.1) 
+00 > lim w*n (w) > 0. (4.2) 


If (4.1) holds, inequality (4.2) canbe satisfied, as shownin /3/ (pp. 85-86), 
by a sufficiently small variation of the parameter @. We arrive at the 
following result (for more details see /26, 47/): 

Theorem 4.1. (Popov's criterion). In system (1.2), let P bea 
Hurwitz matrix and let (4.1) be satisfied for some parameter ®. Then the 
linear part of system (1.2) is uniformly absolutely stable for the class of 
continuous functions 9 (0) satisfying (0.4). 
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As mentioned in the introduction, this theorem* was obtained by Popov 
/12/ in a different way. Let us describe his method, assuming for 
simplicity that instead of (0.4) we have the stronger conditions 


19(9)| <P im, 9< 69 (3) < pos? (Ho S +00). (4.3) 


On the basis of this condition we may assume that instead of (4.1) we have 
the somewhat weaker condition 


Hs' + Re[(1 + iwd) x (io)) >0 for 0g a< oo. (4.4) 


We shall show that under these assumptions any solution of system (1.2) 
tends to zero for t+ oo. We give the proof for 6 >0 (the case 6<0 is 
analogous but slightly more complicated). 

From (1.2) we have 


t 
x(t) = efx (0) + Ve? (99 [0 (t)] de, (4.5) 


t 
a(t) = o9(¢) +) Q(¢—+) gla(x)} de, (4.6) 


where 4, (f) = r°e?x (0), Q (t) = r°eP"a. 
There exist 7 >0 and e>0 such that |e"*| C ve“. From (4.5) it follows 
that 


é 
[x (0) < re**} x(0)] + | re* 91g] 19(0) | de, (4.7) 


Ix()1< ret] (0) + 71919, 8 (4.8) 


i.e., x( and o (f) = rx (tf) are bounded functions of time. We shall show that 
o (f)} 0 for t-—oo. Then @¢ [co ())] 0 (@ () is a continuous function) and from 
(4.7) it will follow by l'Hopital's rule that |x ()}|— 0 for ¢ +0. 

Let us denote 


g, (t) = g[o(t)] for 0Rt<7T,9,=0 fort>T or t<0, (4.9) 
e 


E(t) = (Q¢t—+) 9, (x) de. (4.10) 
® 


Here T > 0 is for the time being arbitrary. Consider the integral 


ows e 
J,= ( ((& —9,H5") 9, + SE9,) de. (4.11) 


—o 


* To be precise, a similar but somewhat weaker theorem. Condition (4.1) can be weakened (replacing > 
by >) by imposing certain additional restrictions. 
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By Parseval's equation 


-+-co 
Je= J (E—Fy3") H+ 0 (io) FH) do, (4.12) 


=~ 


‘where E. er are the Fourier transforms of the functions §&, 9,, and the star 
denotes the complex conjugate. It follows from (4.10) that f=—y~(io)g;, 
where x (A) is the transfer function (1.5). Substituting the value of — in 
(4.12) we obtain (using the fact that J; is real) 


i 
ce ( Re[pe! + (1 + iw6) x (im)}-| 9, do. (4.13) 


— 


It follows from (4.14) that J;-<0. Substituting § (9 =o (f) —o, (6 in (4.11) and 
+00 T 

replacing \ by \ and 9; by 9, (which is permissible by virtue of (4.19)), 
-2 e 


we obtain . 


Je =o —en3)@ + BOP + (00 + B6o) 9) dt. (4.14) 


From (4.3) we conclude that »() = lo— 9 (c}t,-'p («) > 0 fora #0. Since 
J>< 0,050, it follows from (4.14) that 


T -) @ (0) 
{ vle(ide<{ |or+ Oay|dt-9,, + 0 ' qds (4.15) 


and since T >0 is arbitrary, the integral ( (s(t)) df converges. The 
° 
function o () = r° (Px + gp) is bounded. If we had o( «0 for ¢ oo, then 
for certain t, ~oo and 6 >0 we should have |o (,)| > 26, and consequently 
also jo (| <6 in the intervals (¢, —¢e,f, +e), where e>0 is some number 
@ 
determined by sup|o|. Then \» lo (t)]df = oo. This contradiction shows that 
6 
o ()} - 0 for t-— oo, and the proof is complete. 
We note that the estimate (4.15) easily yields absolute stability in the 
class of nonlinearities satisfying (4.3). 
The frequency conditions (4.1) or (4.4) have a simple geometrical 
interpretation. Put 


§ (w) = Rex (io), n(@) = olm x (iw) (4.16) 
and construct in the (§,n} plane the curve § = £ (), 7 = (), which, following 


Popov, we shall call the modified frequency response. The curve § = § (@), 
71 = 17 @) is obviously obtained from the frequency response x (iw) by 
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multiplying the ordinates by o. From (1.5) it follows that 
E (@) = °(P? + wf) Pq, 1 (@) = wr" (P? + oJ)" 9. (4.17) 


In the (§, n} plane draw a straight line (Popov's line) such that: a) the 
modified frequency response is to the right of this line, b) the abscissa Ep 
of the point of intersection of this line with the axis n =0 is nonpositive, 

c) the value of {)is the maximum possible (Popov's line may obviously 
contain points in common with the modified frequency response (see 

Figure 1)). We set p, = —&) if & +0, and pj =o if; =0. Popov'scriterion 
(condition (4.1)) states that there is absolute stability in the class of 
nonlinearities satisfying 0<9@(o)/o <p, <p;, ifpj-o, or poo >0, if 

; = 00. Condition (4.4) is interpreted analogously. 


n 


The connection between the Popov and Nyquist criteria is easily 
established. We denote by {, the abscissa of the extreme left point of inter- 
section of the modified response with the axis yn = 0if this abscissa is 
nonpositive, otherwise &,=0(see Figure 1). The value of £, is obviously 
unchanged if the modified response is replaced by the frequency response 
y (“#). Therefore, according to the Nyquist criterion, the linear system 
obtained from (1.2) for g (¢) =powill be stable for 0G p<Cpn=E;,, if En#O, 
or for all p>0, if &)=0. It may happen that &:)= €; (see Figure 1b). 
Aizerman's well-known problem is relevant for systems of this type. 

Popov's criterion is thus very simple to apply and very intuitive. 

We mention another important fact. The above proof of condition (4.4), 
which does not use the matrix inequalities of § 3, remains unchanged when 
the system is described by equation (4.6), where the function Q (4 is an 
arbitrary continuously differentiable function satisfying the condition 


IQ’) +(QOl<re*, (4.18) 


where 7+~>0,¢>0, or some other less restrictive condition. The above 
assertions are made more precise by the following theorem /18/. 

Theorem 4.2. Suppose the transfer function x (p) ts analytic in the 
half-plane Re p > —e, where e>0, and a, (t), Q(t) ave continuously differen- 
tiable and satisfy inequality (4.18), and |«o(t)| +|0(t)|< roe. 
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Then if Popov's condition (4.1) is satisfied, the system described by 
equation (4.6) is absolutely stable for the class of continuous nonlinearities 
@ (c) satisfying 0 < og (c) <_p,07 (uy < 00), im the sense that lim o (t) = 0 and 
sup 19 ()|-+ 0 for 1, +0. 7 


Theorem (4.2) may be applied to systems more general than (1.2), for 
example, to systems with a delay of the form 


A 
¢ = 5} Px (t—%)) +99(9), = rx, (4.19) 
j= 


where +;>0, and others. In such cases it is sufficient to know only the 
frequency response X (/#) of the linear part of the system. In the case of 
system (4.19) we obviously have 


x0) =0°(5} Pe — al) 9. 


/=1 


Theorems 4.1 and 4.2 may also be extended to the case of many non- 
linearities. The following generalization of Theorem (4.1) is valid in this 
case. 

Theorem 4.3. (Popov's criterion for the case of many nonlinearities). 
Consider system (1.2) for» >1, where P is a Hurwitz matrix, and the 
continuous functions 9,(0,) satisfy (1.6), where w,;s«:00. Assume that for some 


x xx diagonal matrices +4, 04g with diagonal elements z,>0,...,%:>0, 
6,,...,0., respectively, we have 
% (w) = Tyg! + 2Re (ty + i@Oy) x (iw) > 0, (— 0 <o< + 0) (4.20) 
lim on (w) >0, (4.21) 
where py is a x xx diagonal matrix with diagonal elements y,,..., x. Then, 


a) the linear part of system (1.2) is absolutely stable for the indicated 
class of nonlinearities; b) a Lyaupunov function can be constructed for 
system (1.2), in the form 


a bd | 
V = x°Hx+ >} 0, ( , (6,) doy. (4.22) 
j=i rs 


Part a) was proved by Popov /12/ for the case in which (1.6) is replaced 
by ejo7< 9,9, (0/)< (H) —&)) 9} « 

For x = 1 onemaytake tg= 1 without loss of generality, and then (4.20) 
coincides with (4.1). Other similar formulations of Popov's condition are 
known in which, under certain additional assumptions, the signs > are 
replaced by >(and, naturally, condition (4.21) is eliminated and the 
elements +t, of the matrix tg may be zero) and, consequently, the parameter 
® in condition (4.1) may take the value too, etc. 

Popov has also considered the simple critical case. The formulation of 
Theorem (4.1) does not change, except that (0.4) is replaced by (0.5) and the 
necessary condition for absolute stability 


I =Resx(\)>0, 
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due to Lur'e, is added. In /25/, Popov's criterion is extended to the general 
critical case. Let us formulate the result of /25/ for the case of two purely 
imaginary roots. 

Theorem 4.4. Consider the system (1.2) with x = 1 and assume that 
equation (1.3) has two purely imaginary roots + i #0 and that its remain- 
ing roots lie in the left half plane. Represent the function x (A) (see (1.5)) in 


the form % (A) = 1A) + ts where %: (A) is holomorphic on the imaginary 


axis. For absolute stability of system (1.2) in the class of continuous 
functions 9 (0) satisfying (0.5), it is sufficient that a>0 and 


eee: | poy. ,. 
n (wo) = po! + ae Re[(! +i aor) @()|>0 (4.23) 
for allow >0, if limn @) = 0; in addition, 


lim wx (w) > 0. 


We note that n @) can be represented in the form 
K(o) = yt + Re [(1 +: *) x (io)]. 
AW, 


The inequality a> 0 is a necessary condition for absolute stability. 


$ 5. Solution of Lur'e's Problem. S-procedure 


Let us return to the existence problem of a Lyapunov function of class L 
(i.e., of the form (2.11)) for system (1.2), for the principal case and x= 1, 
with an eye to the complete solution, i.e., we require not only sufficient 
but also necessary conditions for the existence of such a function. 

Examining the proof of Theorem 3.2 (§ 3), one easily sees that when 
class L, is considered instead of class L (see (2.2)) and p, = co Lur'e's 
method gives not only sufficient but also necessary conditions. In fact, 
it is easily shown that condition (2.4) is not only sufficient but also necessary, 
and the proof of Theorem (3.2) implies that, without loss of generality, the 
matrix G may be assumed of the form G = uu’ +e /. 

In the general case a number of difficulties arise. For simplicity we 
confine ourselves to the case p, #00. The derivative of the function (2.11) 
has the form 


—V = 2'Gx + 2x"gop + ToF*, (5.1) 


where 9=9(s), gor— (Hq + 4 OP's) > To=—POr'q. . 


It is natural to consider the following problems. It is required to find 
existence conditions for a matrix H = H* such that (1)V << Ofor x0, where 
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@=9@ (c) is a fixed function satisfying (0.4) and og = rx, (I)V <0 for x+#0 for 
any function @ = @ (c) satisfying (0.4), (III) V is a negative definite form in 
the independent variables x, @, where @ and oa are related by (0.4) or by 


(o— gps) o>0. (5.2) 


It is obvious that problems (II) and (III) coincide. If we assume that in 
problem (I) Inf@ (o)/c = 0 and sup@ (o)/o = p, (where the Inf and sup are attained 
for po # 0), then, as is easily checked, problem (I) is also equivalent to 
(II) and (III). One may thus consider problem (III), which is, however, 
fairly complicated. 

To simplify the problem, we consider a special method (the S-procedure), 
first applied by Lure's /1/ and Rozenvasser /4/, which has also been found 
useful in a more general situation. The S-procedure for the general case 
is as follows. Let us assume that ® (y) is a function of the vector argument 
y depending on certain parameters, conditions relating which must be found 
to ensure ® (y) > 0 in the domain determined by the relations 


yO, 2 (y)>0, (J=1,.-5 &) (5.3) 


where 2; (y) are certain functions depending on the same parameters. Let 
us set up the function 


a 


S(y) = O(y) — JY 2)(y) 


font 


(where t,>0 are certain additional parameters) and require that, for 
values of the parameters t,;, >0 and all y #0, S(y) >0 be satisfied. Then, 
obviously, ® (y) >0 in the domain (5.3). The procedure thus reduces the 
problem of conditions that the function be positive-definite in the domain 
(5.3) to the analogous problem for all y +0, which is usually simpler. In 
this case, however, simple examples show that the required domain in the 
parameter space may be larger than the domain obtained by the S-procedure. 
Nevertheless, it will follow from the sequel that these domains coincide in 
the case under consideration. * In our case y=Ix, 91, & = 1, Q, = @ (7x — qpz) 
and, without loss of generality, t,= 1. Application of the S-procedure leads 
to the question of conditions for positive-definiteness of the expression in 
square brackets in (2.12). A solution of this problem is provided by 
Theorem 3.1 and leads to Popov's condition (4.1). 

From part 2) of Theorem 3.3 it follows that if in addition the vectors 


q, Pq,...,P’"g (5.4) 


are linearly independent, a complete solution of the problem is also given 
by Lur'e's method, which leads to the resolvents (2.16). 

If the S-procedure is not applied, the problem of positive-definiteness 
of the function (5.1) leads, putting @ =ypo, to the problem of existence 


* It would be important to find a general characterization of the classes of functions for which these domains 
coincide, i.e., of the classes for which the S-procedure does not “worsen” the result. For & = | and 
quadratic forms the coincidence of these region follows from /9/. 
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conditions for a matrix H = HH’ such that 


; x°(G + (gor® + 7g) B+ Torr'p*} x >0 


for all x#0and 0<p<p,. A Solution of this problem is provided by 
Theorem 3.5, which yields a necessary and sufficient condition coinciding 
with (4.1). 

Let us formulate the result /9/. 

Theorem 5.1. Assume that the vectors (5.4) are linearly independent. 
The following assertions are equivalent: 

a) There exists a Lyapunov function of class L for system (1.2) which 
ensures absolute stability, such that V<0 for «+0. 

b) Popov's frequency condition (4.1) is satisfied for some value of the 
parameter ®. 

c) Lur'’e's resolvents (2.16) have a real solution ® for some value of 
the parameter u. 

As indicated, an analogous statement holds in the simple /3, 6/ and 
general critical /9, 25/ cases. An analogous statement is true also when 
the sign >in (4.1) is replaced by > and the values @= +00 are permitted, 
and for the Lyapunov function V < 0. 


§ 6. Absolute stability with given damping index 


We require conditions on the linear part of system (1.2) which ensure 
that 


| x(t) |< Be-*/] x(0)], (6.1) 


where a> 0 is given; the constants a and B depend only on the parameters 
of the linear part of the system and the numbers yp, in formula (1.6). Thus 
the numbers aand f must not depend on the nonlinearities gq, (o,) or on x (0). 
When (6.1) is satisfied, we shall say that system (1.2) is exponentially 
absolutely stable with damping index a. 

Consider the case x = 1 and suppose that poxeoo and that the rank of each 
of the matrices 


1g, Pq,..-. P°'gh, Ir, P'r,..-, PP2Y (6.2) 


is v.* Since we may take @ () =0 in (1.2), the roots of equation (1.3) must 
be assumed to lie in the half-plane Rea< — a, and thatthe simple elementary 
divisors of the matrix P are the roots on the line Rea = —a. Instead let us 
assume that the roots of equation (1.3) lie in the half-plane REeA< —a. 

Theorem 6.1. Under the above assumptions, it is sufficient for 
exponential absolute stability with a damping index that for some and all 
o> 0 


Me (©) =a ps? + Re [(1 + iwd) x(—a+ dw)) SO. (6.3) 


* It is easily shown that this condition is equivalent to nondegeneracy of the function x(a), which means that 
x(A) cannot be expressed as the quotient of two polynomials where the denominator is of degree less than v. 
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A similar statement, not assuming the conditions regarding the matrices 
(6.2), is proved in /32/; the sign > in (6.3) is replaced by > and the 
condition +00 > limo* x(a) >0 is added. In this case a may be replaced by 


a+e with sufficiently small e>0 in (6.1). Theorem 6.1 is proved in 
exactly the same way, using Theorem 3.3 instead of Theorem 3.1 
Inequality (6.3) has the same geometrical interpretation as Popov's 
criterion (see § 4). 
There is an analogous statement for x >1. Repeating the argument of 
/32/, we see that Theorem 6.1 remains valid also for x > 1 if condition 
(6.3) is replaced by 


Tug + Re l(t, + swd,) x(—a + fw)) > 0, (—o<w< +00) (6.4) 


where ty and 6, are diagonal matrices with diagonal elements +, >0,..., 
t, >0,98,,...,0, respectively, and all the numbers @, have the same signs 
(i.e. : 8,0: > 0 ). 


§ 7. Systems with discontinuous nonlinearities. 
Absolute stability of the stationary set 


Consider system (1.2) with x = 1 for simplicity, where 9 (0) is a piecewise 
continuous function, such that for each discontinuity point o, there are also 
finite limits 


G_ (9) = lim p(s), , (6e) = lim (6). (7.1) 
o<e, eee 


We retain the notation 9, (o.) and @_ (o,) also for points of continuity; then 
©. (9.) = 9. (6.) = @ (0.). Taking into account the possibility of sliding motion, 
we arrive at the following definition of the solution: a solution of the system 
(1.2) is any function x (, such that for some function » (f)* 


Sf = Px + qv (ty, 
go < (1) <e, [o(t)}, oft) =r'x. 


(7.2) 


We shall call the function » (9) the complementary function of @ {fo (f). It 
is obvious that for a continuous functiong (a), this definition agrees with 
the usual one. If @(c)satisfies condition (0.4), then 


0 0(t) 9 (4) << H06 (#)? (Ho< + 00). (7.3) 


We note that all the results of the present section remain valid for any other 
definition of the solution, when, for example, equations (1.2) are supplemented 
by the equations for the sliding motion /49/, provided (7.3) is satisfied for 
the complementary function. . 


* The function x(f must be absolutely continuous, and y(f) locally Lebesgue-integrable. Relations (7.2) must 
hold almost everywhere, This definition of the solution {s in the sense of A.F. Filippov /48/. 
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Examining the proof of Theorem (4.1), which is based on the results of 
§ 3, we see that it carries over with slight changes, if 9 [o (J is replaced 
everywhere by » (/)(see /24, 25/). 

Theorem 7.1. The assertion of Theorem 4.1 remains valid for the 
above class of discontinuous functions 9 (0). 

A similar theorem was established by Tsypkin /15/, by a different 
method. 

The argument of § 4 leading to condition (4.4) make essential use of the 
continuity of g (o). This is not accidental: in the case of a discontinuous 
function, condition (4.4) does not guarantee absolute stability. In fact, if 
o = 0 is a discontinuity point of the function 9 (co) and x (0) = 0, the system 
(1.2) has a ''stationary interval’ 


x=—P'¢-9, pele_(0), 9, (0)) (7.4) 


(for the stationary solution (7.4) the complementary function ~ (9 =). Thus 
the solution x = 0 is not absolutely stable, although (4.4) may be satisfied. 
Nevertheless, in this case one may define absolute stability of the stationary 
set (7.4). 

Various types of stability of the stationary set are possible: an arbitrary 
solution may tend to some point of the stationary set, or may converge to 
the stationary set in the sense of approaching it without limit, while not 
tending to any of its points. Let us give the rigorous definitions /30/. 

Consider the system 


& =] (x), (7.5) 


where x and f(x) are v-dimensional real vectors, and / (x) is measurable 
and bounded in any finite part of the space {x}. A solution will be understood 
in the sense of Filippov /48/ and the stationary set (the set of equilibrium 
positions) will be assumed bounded, connected and closed. 

A stationary set A is called stable in the large, if it is Lyapunov-stable 
"in the small" and 


p (x(t), A} = min|x( —z|— 0, for too. 
2A 


If Ais stable in the large and, in addition, when ¢ — co each motion zx (f 
tends to a definite equilibrium position z@A, we shall say that the 
stationary set A is pointwise stable in the large. 

Gelig has established the following lemma of the Lyapunov type: 

Lemma 1 /30/. Suppose that for any ceGA and any vector x there 
exists a continuous function V (x, c) satisfying the following conditions: 

(I) V(x,c)>0 for xGéA, lim V (x,c) = 0 and Jim (x, c) = oo; 


(I) there exists an e-neighborhood of the set A, in which for anycGA 
the function V (x, c) does not increase along any trajectory of system (7.5); 

(II) there existse,GA, Such that V (x,c¢) does not increase along any 
trajectory, and on any surface V (x, ¢) = const there are no other positive 
semti-trajectories of system (7.5), apart from the equilibrium positions. 

Then the stationary set A of the system (7.5) is stable in the large. 


49 


GANT MAKHER and YAKUBOVICH 


If in addition to conditions (1h—(IIl the function V (x, c) has the property 

(IV) Vi(x,c=0 for any cEA only for x=c, then the stationary set 
ts pointwise stable in the large. 

Application of this lemma to system (1.2) using the matrix inequality 
theorems of § 3 yields the following result /30/. 

Theorem 7.2. Suppose x= 1, the principle case obtains and the 
following conditions are satisfied: 

(1) The function q (0) has a discontinuity of the first kind at o=0, at 
continuity points it satisfies inequality (2.1), and there exists e>0 such 
that 9 (0) > 9, 0) for0<o<e, and @ (0) <9. (0) for —ex<a<0; 

(ID x (0) =90, x (io) £ 0 for wo + 0; 

(II) the rank of each of the matrices (6.2) is 4; 

(IV) there exists 6>0 such that for all o>0 


Re ((1 + iwd) x (iw)) > 0 (7.6) 


and if ®>0 the number — 6" is not an eigenvalue of the matrix P. 

Then the stationary interval (7.4) is pointwise stable in the large. 

In /25/ the general critical case is considered, when x =0 is the only 
stationary state. 

The research of Gelig and Komarnitskaya, soon to be published, deals 
with the general critical case in the presence of a stationary interval. 

The case x > 1 is considered analogously. Theorem 4.3 is also valid 
without alteration for the case of discontinuous nonlinearities. 


§ 8. Systems with hysteresis nonlinearities 


Consider the system 


& = Px + elo, Golt, o=rx (8.1) 


in the principal case and, first, for x=1. Here @ [c,9,); is a hysteresis 
function. The value of 9 lc,9,}; is the output of the nonlinear unit at the 
instant ¢ in response to the input o (t),0 <1 < ¢; it depends, in addition, on 
the state @ of the nonlinear unit at the initial instant¢=0. More formally, 
let us assume: 1) with each o,, —co <0,< +00 we associate a set E [o,) of 
"initial values of the hysteresis function’; 2) with each¢>0, each function 
o (t) continuous in (0, t) and any 9g © E [oc (0)]) we associate a number 4 lo, 9); 
3) @ lo, q,), =@- We shall then say thatthe hysteresis function 9 lo,@) is 
given. If the number @ lo, @,|: is a continuous function of the two numerical 
arguments {and o (f), we shall say that the hysteresis function is continuous. 
The simplest example of a continuous hysteresis function is provided by 
backlash (Figure 2). Hereo (Nis the abscissa of the point A, E [o,) ={0,, o, + 4), 
and @ lo,q Jr is the abscissa of the point 8. Figures 3 and 4 give with the 
usual interpretation graphs of some continuous hysteresis function. More 
general definitions are possible, according to which 9 lo,q],; is in general a 
discontinuous hysteresis function. In this case the function @ lo, @,) in (8.1) 
is a complementary function, in analogy with the case of ordinary 
discontinuous functions (see § 7). 
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The vector hysteresis function is defined in a similar manner; in this 
case 9 [o, Qolr = [le (67. MIke is a x X 1 vector. We shall assume that for each 
scalar function 9,lo;, 9], we have 


0< 6, (t), 157, Py], (0)? fH... ® (8.2) 


for any 4>0. This condition means, in descriptive terms, that the graph 
of the hysteresis function of 9, lo,, Qj, while generally varying in time, 
always remains in a fixed region of the (o,q9} plane between the lines o = 0 
and g=yp,o. The backlash (see Figure 2) does not satisfy condition (8.2) 
with p, =p, whereas the functions in Figures 3 and 4 do. 

Theorem 8.1. Suppose that for some diagonal x x x matrix +, with 
positive diagonal elements and for allw >0* 


TyHg’ + Reltyx (i@)] >0, (—co<o<-+ oo) (8.3) 


where p,is a@ x x x diagonal matrix with diagonal elements p,,....p.. Then 
the linear part of the system (8.1) ts exponentially absolutely stable for the 
class of nonlinearities satisfying relations (8.2), i.e., there exist constants 
e>0,71>0 depending only on P,q,1,p;,..-.pu, Such that for any solution of 
system (8.1) and any t > 1, 


[x (OLS ve-# ¢-40 | x (ty) |. (8.4) 


Proof. For the function V (x) = x°Hx, where H = H* is the Poazerce 
matrix, we have V = 2x°H (Px + qq). 
Here g=9 1o,9,). Applying the S-procedure, we teanstorn: V by 


—V = [x°Gx + 2x gp + ot p39) + Q, (8.5) 
where 


22 p> P19; — 9B"), B= —(Ha +1rt,), 6G =—(P'H + HP). 
=i 


Since t;>0, we have from (8.2) that Q>0. By Theorems 3.4 and 3.1 it 
follows from inequality (8.3) that there exists a matrix H = H*such that the 
expression in square brackets in (8.5) is a positive-definite form in x and 
@. In this case G>0O implies H>0. From (8.5) we have, for some e>0, 


—V <2 (8.6) 


Integration of (8.6) yields (8.4). 

The above proof, as well as Theorem 8.1 may be refined at a number of 
points (under certain additional conditions, the sign >in (8.3) may be 
replaced by >, t and p, may take the values 0 and oo respectively). For 
x = 1 condition (8.3) obviously takes the form p+ Rex (io) >0. For x«=2, 
when | xX:s| = const |x_;! the matrix condition (8.3) is transformed, as is easily 


verified, into 
prt+ Rex (io) >0, ps? + Rexss (io) >0 
2 (pz? + Re X11 (é)) (pS? + Re X02 (do)) > (8.7) 
> | Xr0 (fo) | | X91 (40) | + Re (X23 (f@)- X01 (é)). 


°* Here, we recall, ReZ = + @Z@+2Z°%). The left-hand side of inequality (8.3) is a = X % matrix. 
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For x = 2, when | %,3/== const|x.,{, we must have max yn, (o) < minn, (@) in 


addition to (8.7), where yn, <n, are roots of the equation 


M1! Las [* + 177 | Xa2 |* — 4 (wy? + Re xar) (wy! + Re Xs3) + 2 Re (xXis%01) = 0. 


We note that Theorem 8.1 is also applicable when some ofthe ''nonlinear 
units" are "linear nonstationary’, i.e., when 9, lo;, Qo) =, (to, for certain /, 
where a, (f are any functions satisfying (by virtue of (8.2)) the condition 
0< a, (1) << wy. 

For x = 1, Theorem 8.1 was proved independently by Popov /26/ (ina 
similar formulation, but without the estimate (8.4)) and by one of the 
authors /27/. The above proof resembles that of /27/. 

Formulation of additional results uses certain properties of hysteresis 
functions. 

Definitions: 1) A hysteresis function* @ (o,9,|, is called finitely- 
bounded, if for any +> 0 there exists 7, >0such that the |o (Nic 1 implies 
Ie lo, le|< 7, for any t and o (t). 

2) A hysteresis function is said to satisfy the positiveness condition if 
for any function o () and any 9, & E lo,) there exists a constant 6>0, such 
that 


ft 
| Plo.geled (t)>—86 for ¢>0., 
° 


and the strong positiveness condition if there exists a nondecreasing function 
& (oc), defined in (0, oo), such that & (0) = 0 and for any a (fh and 9, & E lo,) we 
have 


t 
J e16, Gedo (x) > — 8 [0(0)}. 


3) A hysteresis function is called continuous in the limit, if the relations 
o() ~0,. @ (0, @¢ ~@, fOr t ~ oo imply that 9, GE lo,) and @ \0,.9, )=9,. 

We illustrate these definitions with the following examples. @ {o,q]- = 
=a (fo*, for ja (f)}|< const, is a finitely bounded function satisfying the 
positiveness condition if @(9 >0 and da (N/d | 0. It is not continuous in 
the limit if @ (4) = const. 

The backlash is continuous in the limit. The functions in Figures 3a 
and 4a satisfy the positiveness condition, while those in Figures 3b and 4b 
do not. In the latter case, however, the functions po —@ [o,¢@,},; do satisfy 
the condition. 

Condition 2) means, in descriptive, imprecise terms, that the area 
increment along any hysteresis loop is nonnegative for the function @ lo, @)-. 

The backlash, a hysteresis function important in applications, does not 
satisfy condition (8.2). When condition (8.2) is not satisfied, the system 
(8.1) may have a nonunique equilibrium state. Stationary solutions of the 
system (8.1) exist for det P #0 and det x (0) #0, as is easily verified in the 
following manner. 


° We are considering a scalar function. 
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Let o =o, and @ = 49, be any stationary solutions of the equations 
0,+2(0)9,=0, Gi3,, Pole =9,. (8.8) 


(If x = 1 and 9 lo, q)); is the backlash, then (,,9,) are the intersection points 
of the backlash graph with the characteristic line o + x (0g = 0 — see Figure 3). 
The set of stationary solutions of (8.1) is the set of all c= —P™" gp, where 
gy, runs through all the values obtained as above. 

Theorem 8.2. Suppose that the conditions of Theorem 8.1 are 
satisfied, with condition* (8.2) replaced by 


ds, dq,[s¢,, ,] ds,\> 
Oc St Er Gor cy (=), jimi ce ae (8.9) 


Then as t — co the solution x (t)and the value of the vector hysteresis 
function q {o, ¢,) tend to certain limits 


X(t) > Xe 9, Folt > ,,: (8.10) 


This convergence ts exponential, and moreover there exist constants +,, 
2>0 depending on the linear part of the system (but not on 9 lo, @), oY on 
the initial values x (0),9,), such that 


| x(t) — xen & (7112 (0)] + Y1 Gol} em", (8.11) 
1915, Go), — 9 |< [19] x (0) |] + 11 Gof) em 


If 9 lo), Pls ave Continuous in the limit, then the limits are stationary 
solutions of system (8.1). 

Theorem 8.3. Under the assumptions of Theorems 8.1 and 8.2, if all 
the roots of the equation det (P —A/J) = 0 lie in the half-plane ReA< ~a <0, 
the rank of each of the matrices (6.2) is+,and for some diagonal matrix 
with positive diagonal elements we have 


Tygt + Relt,x(—a+ dw)} >0 for o>0, (8.12) 


then in (8.4) and (8.11) we may take e=a. 

The proofs of Theorems 8.2 and 8.3 coincide word for word with those of 
Theorems 3 and 4of /28/, where the case x = |] isconsidered. To prove 
Theorem 8.2, one must differentiate system (8.1) with respect to time and 
apply Theorem 8.1. From the estimate (8.4) one easily derives (8.11). 

We note that when @¢ [o,@,), is the backlash, condition (8.9) is satisfied 
with p, = 1. 

We formulate the following theorem /27, 28/, for simplicity, for x = 1. 

Theorem 8.4. Supposex=1, 


0 € 3 (1) (6. Pole 09 (1)*, (eK 00) 


° The case when instead of (8.9) we have pla? < ¢,<njeF may be reduced to the case under consideration 
by the substitution gi = ¢, —p{"e,. 
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and ys, = oo; then 9 lo,q) is a finitely bounded function. * 

If elo, gh satisfies the positiveness condition and (4.1) holds for some 
&>0 or (for p+ ~) if po —@ lo, g) satisfies the positiveness condition and 
(4.1) holds for some ®<0, then 


x()-+0 for t+, 8.13) 


where x(t)is any solution of system (8.1). 

If the positiveness condition is replaced by the strong positiveness 
condition in the above condition, system (8.1) is absolutely stable, and for 
Hy # 00 if is uniformly absolutely stable. ** 

This theorem shows that Popov's condition remains valid in the case of 
a hysteresis nonlinearity, if the parameter @ has a definite sign depending 
on the direction in which the hysteresis loop is circulated. 


§ 9. Differentiable nonlinearities strengthening of Popov's 
frequency condition 


The results of § 5 imply that Popov's criterion cannot be strengthened 
by using Lyapunov functions of class L. Consider (for x = 1) the class L, 
of functions 


V = x°Hx + 2x"hp + 10° + 0 | qde, (9.1) 
e 


where H is a vX v Symmetric matrix, 4 is v x | vector, 7 and ®are 
numbers. We shall assume that @ (a) is differentiable and that in addition 
to (0.3) we have 


—2< g (6) a, (9.2) 


where a, o#= oo,a, + oo and without loss of generality a, > 0,a, > py. 

The problem of existence conditions for a Lyapunov function of class L, 
for system (1.2) can be reduced, using the S-procedure, to certain new 
matrix inequalities /9/, similar to those of § 3; their solution yields the 
following result /32, 33/. 

Theorem 9.1. Consider system (1.2) for x =1 in the principal case. 
If there exist +, >0and «,>0 such that 


+00 >%s lien ar [J (edo — sO |>0, (9.3) 
and such that for all o>0 
m, (©) =a; (5! + Rex (io)) + 0 Re fiwy (io)) + 
+ tq? (1 + (43 — ao) Re x (4) — aay | x (io) [7] > 0, 
then system (1.2) ts asymptotically stable in the large. 


(9.4) 


* For pe< oo this condition is satisfied automatically. If p= limm(w)>0, where n(w) is defined by 
oe--o 


formula (4.1), the assumption of boundedness may be omitted. 
°° For the definition of absolute stability and uniform absolute stability see § 1. 
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An analogous statement is valid for the case when one of the numbers 
@,,a,, and p, iS coo* Then x, () has the form 


My (©) = ti [Ho' + Re x (iw)) +6 Re [iwyx(iw)] + eorgw* (Re x (/w) + a| x(io) [*), 


wherea = —a,,2,=1, if a, = o,a =a,,e = —1, if a, = oo. 

In this case the condition +~o> lim wtx, (o) > 0 must be satisfied in addition 
to (9.4). 

Under certain additional assumptions, an analogous statement is valid, 
with the sign > in (9.4) replaced by > and the conditions +t, > 0, 1 >0 by the 
conditions +t, >0, 1 >0 (see /33/). 

Condition (9.3) restricts the behavior of the function (0) only at infinity; 
it is satisfied if, for example, there exist finite or infinite limits lim @ (a)/o. 

o-- fo =] 


Popov's condition is obtained from (9.4) by putting t,=0. The problem 
of existence of the parameters 1,,1,, 6 has a simple geometrical solution. 
We denote 


G (@) = w* [1 + (4 — a) Rex (io) — ayag| x (iw) |*), 


(@) = Fi [met + Rex(iw)}, nw) = -75 Reliox (io)] (9.5) 


and construct in the {§, n} plane the curves** 


ri (E=E(o), n=n(o), (0) >O}, 
r_{—=€&(@), n=N(o), C(w) <O}. 


We assume for simplicity that there are no values o = o, for which 
simultaneously & (o,) = 7 @,) = $(@,) = 0. Any straight line in the (E, n} plane 
which intersects the half-line yn = 0, 

§ <0 will be called an admissible line. 
The open half-plane bounded by an 
admissible line and containing the half- 
line »n=0, &>0 will be calleda 
positive half-plane; the complementary 
open half-plane will be called a negative 
half-plane. We shall say that an 
admissible line separates the curves 

lr, and [_if the whole curve I, lies in 
the positive, and the curve Lr_in the 
negative half-plane. The frequency 
condition (9.4) is satisfied (i.e., the 
required values of the parameters 1, 
t,, and @ exist) if there exists an admis- 
sible line separating the curves I, and 
r_ (Figure 5, where the heavy line is 
admissible). Popov's frequency condition 
means, in terms of the curves I, and r_ 


W=W9°D 


* when pe = oo, obviously also a, = oo. 
** The values of @ satisfy the inequality (©) >0, on the curve I, and the Inequality ¢(@) <0 on the curve [ . 
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that there exists an admissible line which separates the curves I, andTr 
and passes through the origin. Figure 5 illustrates a case in whicha 
separating line can be drawn, but not through the origin of the coordinates. 
For such cases Popov's frequency condition is not satisfied, while absolute 
stability may be established by the criterion formulated above. 

A condition analogous to (9.4) can be obtained for the casex > 1. 

In /32/ it is shown that fora, = oor a,= oo, the above frequency condition 
includes all conditions obtainable by using Lyapunov functions of class L,. 
For finite a, and a,the problem remains open; it reduces to justification of 
the S-procedure for the given case. 


§ 10. Systems with external action. Dissipative systems. 
Existence of periodic motions. 


Consider the system 
dx 
a =flt, x) (10.1) 


where / (tf, x) is a continuous function in the (v + 1)-dimensional space {f, x}. 
The system (10.1) is called, as is well-known, dissipative, if: 

(I) There exists a bounded closed set {x} inthe v-dimensional space 
which is invariant for solutions of system (10.1), i.e., a set such that 
x (tf) & F implies x(?¢t)e&F for t>4; 

(II) for any solution x (f)there exists f such that x (4) & F. 

It can be shown that every dissipative system has a solution x () © Ffor 
—oo<t{< +00, The boundedness of the set F implies the boundedness of 
the solution x ( in (—oo, +00). Thus, dissipative systems are systems 
possessing, in some weak sense, limit motions, of which, as simple 
examples show, there may be several. 

The following general proposition of the Lyapunov type, implicitly 
contained in the work of N.N. Luzin /50/ and B.P. Demidovich /51, 52/, 
is similar to certain results of Yoshizawa /39/. 

Lemma 2. Suppose there exists a differentiable function V (x) with the 
properties: 

(I) V (x) — 00 for |x| — oo; 

(II) outside some sphere \x|> 


Vas F(t, )<—a(x) <0, 


where a(x) is a continuous function. Then the system (10.1) ts dissipative, 
and for the set F one may take the set F, of all x for which V (x) < n, provided 
that » >0Ois such that the set F, contains the sphere |x|< . 

Consider system (1.1) in the principle case and, for simplicity, with 
x=]. Application of Lemma 2, the function (2.11) playing the role of V (x) 
yields (using the theorems of § 3) the following result /31/. 

Theorem 10.1. Assume that (I) in system (1.1) the function f (h is 
bounded in (0, co); (ID for some ® conditions (4.1) and (4.2), where 
0< p< +0, ave satisfied; (Ill) the function 9 (o)in (1.1) satisfies the relation 


jim FO (1-5, 22] >0. (10.2) 
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na, if $+0, limx(@)=0, then also 


Then system (1.1) ts dissipative. 

Condition (10.2) is obviously considerably more general than condition 
(0.4). 

Theorem 10.2. Assume that in system (1.1) the function f ( is 
T-periodic and that the conditions of Theorem 10.1, with (10.2) replaced 
by (0.4), are satisfied. Then system (1.1) has a T-periodic solution. 

The proof is based on the application of Brouwer's principle to the set 
F,, which can be shown in this case to be homeomorphic to a sphere. 

Theorem 10.3. Assume that in (1.1) the function f (t) is bounded in 
(0, oc), and »(o) is differentiable and satisfies conditions (0.4) and (9.2). If 
for some x,>0,%,>0, and @ (9.4) is satisfied, then system (1.1) is 
dissipative. 

This is proved by applying Lemma 2 with the function (9.1) and using 
Theorem 4 /33/, which is similar to Theorem 3.1 and concerns a matrix 
inequality. 

Analogous statements are valid for x>1. 


§ 11. Absolutely stable periodic and almost-periodic 
limit motions 


We first consider stability in the large of some fixed solution x (f of 
system (1.1) with x > 1 and continuous functions gq, (o,). Putting y= x( — 2 (4, 
where x (Mis any other solution, we have from (1.1) 


b= Pytqni(sn t), o=r'y, (11.1) 


where 9, (6, =@ fo () + 0,) —@ lo™ ()}, 0 (1) = rx? (f). 
Let 


(8) — 13)? (91 
¢, — of ()) 


0< <p, fol... (11.2) 


for all o,, —o<o,< +oo and ¢>f,, where f, = —oo is possible. Applying 


Theorem 8.1 to system (11.1) we see that when he frequency condition (8.3) 
is satisfied, we have for all [>> 4, 


|x (4) — 2° (8) | & yee 4) | x (fo) — 2° (to) |, (11.3) 


where ¥ >0 and e>0 are constants determined only by the parameters of 
the linear part of the system and the numbers yp,. 
Condition (11.2) is automatically satisfied if for all o! and of we have* 


@, (32) — @, (of?) 


of!) — atv) Sep fl... %. (11.4) 


0< 


* In this case, obviously, we mav take (, = —eo. 
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In the frequently encountered case when the graph of 9g, (c,;) consists of 
straight line segments with (without loss of generality) nonnegative slopes, 
the number yp, is the maximum slope. If the function 9g; (,) is differentiable, 
condition (11.4) is equivalent to 


0 < do, (s))/do;Q py, j=l,..., (11.5) 


In accordance with the customary terminology, we shall speak of exponential 
convergence for system (1.1) if for any two solutions x (f) and z(f) of system 
(1.1) and any t >4% the estimate (11.3) holds for fixed numbers 7>0, e>0. 
The above considerations yield: 

Theorem 11.1. Suppose that (11.4) and the frequency condition (8.3) 
are satisfied, where +, is any diagonal x x x matrix with positive diagonal 
elements and 4is a diagonal xxx matrix with diagonal elements j,...,pn- 
Then system (1.1) has the exponential convergence property. * 

For x= 1a similar theorem is proved by Tsypkin and Naumov /16/. We 
note that under the conditions of Theorem 11.1 the solutions may not be 
bounded in (0, oo) (if, for example, the function f (f is not bounded in (0, oo)). 

Using Theorem 10.1 and additional considerations based on the estimate 
(11.3), the following result can be obtained (the proof is given in /31/ for 
x= 1): 

Theorem 11.2. Suppose that the conditions of Theorem 11.1 are 
satisfied and }f (t)is bounded in (—co, +00). Then system (1.1) has a unique 
solution x (ft) bounded in (—co, +00) (limit motion). This solution is exponen- 
tially stable in the large, i.e., for any solution x (Nand any t>t,, (11.3) 
holds, where y>0, e>0 are determined only by the parameters of the 
linear part of the system and the numbers y,,...,p.. Uf f (f is an almost- 
periodic or T-pevriodic function vector, the solution x ( will also be, 
respectively, almost-periodic or T-periodic. 

We recall that for x = 2 the existence conditions for the diagonal matrix 
ty figuring in Theorems 11.1 and 11.2 are givenin§ 8. For x=1, *Tsypkin 
and Naumov /16/ have given methods for checking the frequency condition 
(8.3) which are convenient for engineering calculations, as well as methods 
for synthesis of stabilizing devices which guarantee absolute stability of the 
limit motion. 


§ 12. Nonlinear sampled-data systems 


A large class of nonlinear pulse systems is described, as shown in /53/, 
by difference equations which, in the previous notation, have the form 


Keer = Pxe +99 (51), Op = r°xe. (12.1) 


We restrict ourselves to the case of a single nonlinearity, x =1. The 
transfer function from the input @g to the output (—s) will be x (e4), where x (A) 
is determined as before by (1.5). 

We shall say that the principle case obtains if all the roots of equation 
(1.3) lie inside the unit circle, and the simple critical case if one of the 
roots of this equation is 4=1 and the remaining roots lie inside the unit 
circle. 


* Our proof fs that of /31/, where the case *% = J is considered. 
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Tsypkin /14/ obtained the following results. * 

Theorem 12.1. Suppose that the principal case obtains and that the 
function 9 (s) satisfies (0.4). 

(D If forall o,0<a< 2n 


Ho’ + Rex (ele) >0, (12.2) 


then system (12.1) is exponentially absolutely stable: there exist constants 
e>0,7>0 depending only on P, 4,17, such that for any solution x (f 


Ho’ + Rex (e) >0. (12.3) 


(I) Uf the rank of each of the matrices (6.2) is v and (12.2) holds with the 
sign > replaced by >, then the solution x =0 of system (12.1) its asympto- 
tically stable in the large. 

(II) Part (l) remains valid if in (12.1) 9 (a) is an arbitrary function of 
x, and t (and, possibly, of certain parameters), related to o, by (0.4). 

Kalman and Szeg6 /36/ have shown that when (II) is satisfied, system 
(12.1) has a Lyapunov function of type (2.2) such that V1<Vr. Using a 
Lyapunov function of class L (see (2.11)), Szeg6 obtained /37/ a frequency 
condition more general in form than (12.3): 

Theorem 12.2 (Szegé). Consider system (12.1) in the principal case, 
and suppose that each of the matrices (6.2) has rank v, q (0) is a differentiable 
function satisfying (0.4), and0 <q’ (0)<a. If for some real numbers p and 
vy and all wo € (0, 2x) 


B+7>0, (B+ 1) Hs! —2Br'g — 2a8 (r°g)*signB > 0, 


(12.4) 
(B+ 4) pa! + Re l(r + Bet) x (et*)) —21PI | (er — 1) x (ia) |? > 0, 


then the solution x, = 0 of system (12.1) is asymptotically stable in the large. 


Under the assumptions of Theorem 12.2, Dmitriev has obtained certain 
other, less restrictive, conditions for stability in the large. His result is 
based on the use of a Lyapunov function of type (9.1). 


§ 13. Survey of the basic frequency conditions 


Consider systems (1.1) and (1.2) in the principle case, for x=1. The 
following three frequency conditions were basic in almost all the theorems** 
of §$§ 6—7: 


* In /14/, instead of the estimate (12.3) (see below) only asymptotic stability {s affirmed, but the theorem 
is proved for systems more general {n form than (12.1). The present formulation {s due to Yu. A. Dmitriev. 
°° Note that we have not formulated the additional conditions, as a rule only slightly rescrictive, which must 
be satisfied to ensure the validity of the following statements (see the corresponding theorems of $$ 6—12). 
In many cases the signs > in (13.1)—(13.3) may be replaced by >, and © may be +0. Condition (13.3) 
coincides with (9.4) for t= r,/ts. 
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po' + Rex (io) >0, (13.1) 
[Ho' + Rex (io)}] + 6 Re [iwy (ia)| > 0, (13.2) 
[vo + Rex (ia)) + 6 Re [ix (iw)) + 
+ tw {1 + (as —ay) Re x (ito) — ay09| x (i@) 9) > 0. (13.3) 


Inequality (13.2) must be satisfied for some ®, and inequality (13.3) for 
some 6 and +>0. In §§ 4 and 9 geometrical constructions for determining 
the existence of the parameters 6 and + were described. 

Condition (13.1) is the most restrictive. In this case the linear part of 
the system has high stabilizing properties: the closed system (1.2) will be 
exponentially stable in the large for a wide class of nonlinearities; any 
function of x and ¢ (in particular, a hysteresis function) which is related to 
the input @~ = 49 (¢) by the relation 0 < 09 < p,0? may be the input o =o (f of the 
nonlinear block. If the nonlinearity is of the backlash type and the condition 
0 <op <p,0? is satisfied (for the backlash p, = 1), then any solution x (A of 
the system (1.2) tends when ¢ — oo to some stationary state. If the system 
is subjected to a T-periodic, almost periodic or bounded in time external 
action, then the system has an asymptotically (exponentially) stable-in-the- 
large T-periodic, almost-periodic, or simply bounded limit motion, 
respectively (§ 11). 

A linear unit for which Popov's less restrictive frequency condition 
(13.2) is satisfied, possesses less stabilizing properties. The closed 
system (1.2) is absolutely stable in the class of any continuous or discon- 
tinuous nonlinearities @ = 9 (c) satisfying condition (0.4). If the nonlinear 
unit is nonstationary (» = 9 (c, 4) or hysteresis (9 =@ lo, q)), then absolute 
stability is maintained only under certain additional conditions (monotone 
variation of the function » (0, f) for fixed ¢, definite circulation of the 
hysteresis loop of yg [o,@,); —see § 8). A system with 7-periodic, almost- 
periodic or bounded external load may not have a single limit motion; it 
remains dissipative. 

A linear unit satisfying the least restrictive frequency condition (13.3) 
also possesses the weakest stabilizing properties. A closed system is 
absolutely stable for differentiable nonlinearities = 9 () satisfying, in 
addition to condition (0.4), also condition (9.2). A system with bounded (in 
particular, periodic or almost-periodic) external action remains dissipative. 

When conditions (13.1), or (13.3) are satisfied, system (1.2) has a 
Lyapunov function of the respective forms 


V=x°Hx, 
Vax He+ 8 (9 (2)do, 


V = xHx + 2x*hp (0) + 19 (0)* + 4 9 (0)do. 
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N.A. Kil'chevskii 


CONTINUUM PROBLEMS OF ANALYTICAL MECHANICS 


The aim of the present paper is to analyze research in analytical 
mechanics pertaining to the field which unifies the mechanics of discrete 
and continuous systems, and to formulate some problems of analytical 
mechanics relating essentially to continuum mechanics. 

We recall that the mechanics of discrete and continuous systems are 
unified mainly by the physical representation of the field of mechanico- 
geometrical objects and by the mathematical description of physical 
phenomena occurring in the field. These descriptions generally employ 
non-Riemannian geometry, and, in particular, variational principles, which 
are similar in form in the mechanics of discontinuous and continuous 
systems. 

We emphasize that it is now almost impossible to separate physical 
concepts from their mathematical description. 

The bibliography includes sources which have contributed to the formation 
of the ideas of the present report, irrespective of the references in the text 
of the paper. 

1. The basis for introducing the geometrical representations connected 
with the equations of motion of bound systems is provided by generalized 
coordinates and quasi-coordinates. 

It is well-known that the properties of a space are determined by the 
method whereby its points are arithmetized by selected coordinates, metric 
and affine-connection coefficients, which establish the equality of tensor 
quantities in neighboring points of the space. These three elements used 
in describing the properties of a space are in general independent. The 
metric of the configuration space or, in general, of a continuous medium 
is usually determined on the basis of the expression of the kinetic energy 
of an element of the medium. 

Let us recall this relation. Consider an element of a continuous medium. 
We assume temporarily that it consists of NM particles. The kinetic energy 
of this system, even in the presence of nonstationary relations, may be 
represented in the form 


N 
2T = % afxinxin, (1, R= 1, 2, 3, 4), (1.1) 
=) 


where #f, are the generalized velocities of the j-th particle, and a? the 
corresponding coefficients of mobility. 

It is assumed that the coordinate xi, is a linear function of the time ¢. 
Putting 


N 
xty = x8 + Ely: Ely = 0: 2 eet, = 0, (1.2) 
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where p/ is the mass of the j-th particle, we obtain from (1.1) 
ese N N of 2 ye v (Dea 
OT = x x p2 aj? + 2 as, Buy Btn + 2Xe Py ik Ein- (1 3) 
=) ad | = 


Passing again to a continuous medium, we assume that a” and p!) are 
differentiable functions of the coordinates x. We put: 


&,, = lim af: p. (1.4) 
ef )-00 
Then 
#2) ) é afl) ry 
Qu. =P & an (sir) +e], (1.5) 
¢ 
We also put 
qt 1c Nae Re . 
av OY =P AWD) Een =— (1.6) 


jal {/=1 


where AY is the volume element. 
The kinetic energy 2T then assumes the form: 


QT = gun (pxexe — VT) AV + O(EW), | (1.7) 


where O (&/;) ~U when &{) — 0. 


Contracting the medium element into a point, while keeping the right-hand 
sides of (1.6) finite, we find 


2K = gin (px'xt — vi), (1.8) 


where KX is the kinetic energy density of the element of the continuous 
medium. The tensor g,. is usually regarded asthe metric tensor. The 
expressions in parentheses in the right-hand side of (1.8) are one of the 
forms of the components of the energy-momentum tensor, or, following 
Levi-Civita, of the kinetic stress tensor. Consequently, an intrinsic 
relation exists between the metric of the space and the energy-momentum 
tensor. This is considered in more detail below. 

The coefficients of affine connection do not follow from consideration of 
the kinetic energy. To determine them we must use the differential 
equations of motion. We mention, for example, the equations /10/: 


Af 4. Photo’ = X%, (a,b, cH E41, 6 42,..., M, N41), (1.9a) 
Tév'o’ = X°, (a= 1, 2,...,8 6 cHl+....,.N +41), (1.9b) 
vu? = Big! (a, J=1, 2,....M, N+), (1.9c) 
hq ‘g* = Q', (J =N+42,..., 3a). (1.94) 

X* =peQ!, Q! =alX,, (a, f=}, 2,...,N 41). (1.9e) 
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Equations (1.9a) and (1.9c), due to I. Schouten, determine the law of 
motion of a system with geometric and nonholonomic connections. The 
remaining equations express the reactions of the connections. It is 
important to note that the coefficients I's. of affine connection, generalizing 
the Christoffel symbols of the second kind, are determined from the 
equation of motion with an accuracy to within a certain term antisymmetric 
in the indexes 6 and c. These coefficients comprise one of the fields of 
mechanico-geometrical objects mentioned above. The facts we have 
recalled are the basis of extensive research extending the possibilities of 
geometrical interpretation of the motions of discrete and continuous 
mechanical systems. In particular, this has led to the creation of modern 
nonholonomic geometry, in the well-known monograph of I.A. Schouten and 
D.G. Struik /27/, in numerous papers of G. Vranceanu, of whose recent 
papers we mention /42/, Z. Horak, A. Wundheiler and others. These 
investigations are concerned essentially with a branch of non-Euclidian 
geometry, and are beyond the scope of the present review. 

One of the founders of the modern geometry of nonholonomic systems 
is V.V. Vagner. His works have contributed, on one hand, to the develop- 
ment of a geometrical theory of nonholonomic manifolds, and, on the other 
hand, have indicated the possible applications of this theory to mechanical 
problems, in particular, of the dynamics of a rigid body /2—4/. Vagner 
considers the configuration space of a rigid body with a fixed point, moving 
by inertia (Euler space), and proves that this space is Riemannian. Then, 
imposing a nonholonomic constraint on the body, he sets up the equations of 
motion of the representative point in a nonholonomic space imbedded in the 
Euler space; finally, using the geometrical properties of the nonholonomic 
space, he obtains the solution of the problem in terms of hypergeometrical 
functions of time /3/. Nevertheless, these papers do not provide a general 
technique for effective application of the geometrization of dynamic problems 
of nonholonomic systems to the solution of concrete problems. 

J. Synge writes in the monograph /26/ that, in principle, geometrical 
methods, in particular tensor analysis methods, cannot be regarded as 
a means for effective solution of special problems, and can be used only 
for describing the most general laws of motion of mechanical systems. 
Later a number of papers, of which we mention that of V.V Dobronravov 
/6/, weakened Synge's affirmation by generalizing a number of fundamental 
theorems of the dynamics of holonomic systems to the dynamics of non- 
holonomic systems, these results being based on substantial utilization of 
the apparatus of tensor analysis. We mention that among these theorems 
are some which make it possible to find the integrals of differential 
equations of motion /6, 14/. We touch upon this problem again in the 
sequel. 

Both recent and previous investigations in the geometry of the dynamics 
of discrete systems link the geometrical investigations of G Vranceanu 
/42/ with the geometrical interpretation of motions of nonholonomic systems, 
due to E. Clauser /31/, M. Haimovici /34/, and J. Klein /36/. The last- 
mentioned paper differs from the others by its use of the metric of Finsler 
spaces to establish a generalized principle of least curvature. 

To summarize, it can be said that the greatest progress in this field is 
bound up with geometrical interpretations of the general laws of motion of 
material systems. Basically, these interpretations are intrinsically related 
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to new methods for solving complicated problems of mechanics, but 
development of effective methods for solving concrete problems on the 
basis of generalized geometrical representations is subject to further 
investigations. It is essential for our present purpose that geometrization 
of the laws of motion of discrete systems is one of the links between the 
mechanics of discrete and continuous systems. 

Problems of geometrical generalizations are also related to unified 
field theory.'' Among the numerous papers in this direction we mention 
several which develop non-Riemannian geometry, for example V. Hlavaty 
/35/, in which the determination of the coefficients of affine connection is 
studied, and of T. Takasu /40, 41/, which apply the methods of nonholono- 
mic geometry to the extended general theory of relativity. 

2. Approximate methods for solving problems of continuum mechanics 
lead to the formulation of new problems of the mechanics of systems with 
finitely many degrees of freedom. As an example we mention the investi- 
gations of I. V. Nikolenko on the statics of thin beams /18, 19/. These 
papers continue to develop the analogy between the motion ofa rigid body and 
the equilibrium modes of thin beams, pointed out by Kirchhoff. Nikolenko 
considers problems concerning the equilibrium of constrained thin beams 
with nonlinear constraints which are nonintegrable in particular cases. The 
author uses methods introduced by Vagner in problems of the motion of a 
rigid body, with appropriate changes. The problems considered by Nikolenko 
refer to a narrow class of problems of continuum mechanics and do not 
therefore yield generalizing conclusions unifying the mechanics of discrete 
and continuous systems. An outline of the possible extension of the methods 
of classical analytical mechanics to the mechanics of the continuum is given 
in the book of J. Leech /14/ and others. The generalized coordinates of 
the system are replaced by field variables, and the three (in the 
general case) spatial coordinates and time are regarded as independent 
variables. 

The Ostrogradskii-Hamilton principle is used to construct systems of 
differential equations of motion of the continuum elements. But Leech points 
out that there is no rigorous foundation for the application of this principle 
to problems of continuum mechanics (see /14/, pp. 119 and 120). 

Let us give some illustrative example. In the applied mechanics of 
deformable bodies, approximate representation of the required quantities 
by various expansions is customary. The coefficients of these expansions 
are special forms of field variables. Of course, if these coefficients depend 
on time, they may be called generalized coordinates. However, this term 
is also regarded in more general cases as the equivalent of field variables. 

Let us first recall some well-known facts from the theory of shells. One 
of the old methods for reducing three-dimensional problems of elasticity 
theory to two-dimensional problems is based on the expansion of the 
quantities determining the stress-strain state of an element of the shell in 
series of ascending powers of the distance of the element from the middle 
surface. 

The coefficients of these expansions, which are functions of the coordi- 
nates of the middle surface, must be regarded as field variables. By using 
the equations of motion of an element of a continuous medium, the boundary- 
value problem of the theory of shells can be reduced to the determination of 
six field variables /9/. In the absence of body forces the kinematic boundary 
conditions generate the equations of the constraints. It is clear that these 
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constraints are not geometrical or integrable. In the presence of body 
forces we do not obtain equations of constraints in the classical sense at all. 
These facts confirm the expediency of investigations aimed at justifying the 
use of the Ostrogradskii-Hamilton principle in continuum mechanics, in 
particular in the theory of shells. 

Let us consider a problem of analytical mechanics arising in connection 
with the approximate solution of a hydroelasticity problem, concerned with 
the motion of a shell carrying a nonviscous liquid with a free surface in 
a conservative force field. This example is only of heuristic value. 

Not solving the hydroelastic problem completely, let us try to determine 
the parameters which will enable us to find the position of the shell and the 
hydrodynamic pressure without investigating the field of the relative 
velocities of the liquid elements. First let us consider the general formula- 
tion of an approximate solution. 

With the nondeformed shell we associate a moving Cartesian coordinate 
system 0%,&f,. The required quantities are approximated by polynomials 
of the form 


~ 
l 
$ 
> 
M4 


_ denrbr BES. (2.1) 


The coefficients a aa, are unknown functions of time, to be determined. 
Their mechanical meaning is characterized by the physical quantity 
represented by (2.1). They can be regarded as field variables or, in the 
more customary terminology, as generalized coordinates of the system, 

if the displacements of the points of the shell and the hydrodynamic pressure 
are approximately represented by the form (2.1), or as generalized 
velocities, if the components of the relative velocity vector of the liquid 
element are approximately represented thereby. The kinematic conditions 
imposed on field variables in this case follow from the condition of incom- 
pressibility, which is approximately replaced by the condition of constant 
liquid volume 


V=V, (2.2) 


from the condition at the wet surface of the shell 


Un = (#). (2.3) 


(where v, is the relative velocity vector of the liquid element and u is the 
displacement vector of the shell element) and from the condition 


pit, bi. Es. Es) — pe = 0, (2.4) 
at the free surface S (where p is the hydrodynamic pressure and p, the 
pressure at the free surface). 

Condition (2.4) implies 


SP +-v,-gradp = 0 (at S). (2.5) 


71 


KIL’CHEVSKII 


Relation (2.5) and the approximate representations of the required functions 
in the form (2.1) lead to the equation of a nonintegrable constraint. In 
particular cases when a moving coordinate system can be chosen (the 
possibility of such a choice is ascertained by visual observations of the 
motion of the pendulum carrying the liquid), such that the product v,-grad p 
is sufficiently small, we find from condition (2.5) 


op 
(3), = 0 (2.6a) 
or, approximately 


D  Papybrbats = 0. (2.6b) 
a, 8, v=o 


Here and below the dot denotes differentiation with respect to time. We 
have again obtained the equation of a nonintegrable constraint, but simpler 
than that following from (2.5). 

The presence of a nonholonomic constraint (2.6b) precludes use of the 
Ostrogradskii-Hamilton and Euler-Lagrange principles for an approximate 
solution of the problem in question, unless there exists an integrating factor 
for the left-hand side of (2.6b). 

It seems that the applicability conditions of these principles to hydro- 
elasticity problems are worthy of further study, since the approximate 
method described here is nevertheless intrinsically related to the precise 
methods for solving these problems. 

If the integral characteristics of the motion of the 'shell-liquid'' system, 
such as the frequency of natural oscillations of the system, are to be 
determined, condition (2.6b) may be averaged over the region in which the 
free surface is moving, if the conditions of the problem provide an 
approximate delineation of this region. 

In this case, after averaging, condition (2.6b) yields the equations of an 
integrable constraint. The physical meaning of this approach is fairly clear. 

Instead of the motion of a body with a cavity partially filled with liquid 
having a mobile free surface, let us consider the motion of a body carrying 
a liquid with a quasi-fixed free surface. It is clear that such an approximate 
approach reduces the number of degrees of freedom of the system with 
respect to the generalized coordinates and distorts the spectrum of free 
oscillation frequencies. Since this simplified solution of the problem may 
nevertheless be useful, we shall consider it briefly. 

Since in the simplified formulation the constraints are stationary and 
integrable, the Euler-Lagrange principle may be used instead of the more 
customary Ostrogradskii-Hamilton principle. This makes it possible to 
use the energy integral to determine the kinetic energy of the liquid. We 
find 

¢ 
7). TO _ 7 —{ (§ (p — Ps) verdt] do, (2.7) 


0) 0 


where TW is the kinetic energy of the liquid, 7¢’ is the initial kinetic energy 
of the liquid, J is the potential energy of the liquid, o is the wet surface 
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of the shell, v, is the absolute velocity of the liquid element. Equation (2.7) 
enables us to exclude the field of relative velocities of the liquid elements 
from our considerations. 

The kinetic energy of the ''shell-liquid'' system can be represented in the 
form 


¢t 
T=T1(q» 9) + (734, Gay 1) be (P= 2 oe NE PHN 2 ee De (2.8) 


where 7, is a quadratic form in the generalized velocities and 7, depends 
linearly on the generalized velocities. The generalized coordinates 4q,,..., qm 
determine the motion of the coordinate system 0,§,f; and the relative 
displacements of the shell elements. The independent generalized coordinates 
neue ++) vet are equal to the coefficients aaesy in the approximate 
expression for the hydrodynamic pressure. 

Eliminating the time ¢ as an independent variable on the basis of the 
energy integral, choosing a new independent variable gq, and putting 


q 
t= ( Ts (9,1 Guay q,) dq, (2.9) 
qs 


where the prime denotes differentiation with respect to 9,, we obtain a 
variational problem 


aVea—m)/ Tae q) dq, = 0, (2.10) 


where A denotes asynchronous variation, and & is the energy constant. 
Introducing the notation: 


FO» dep G=A—MY PUM, (2.11) 


we find from (2.10) 


QF aT, _ d aP\ AF (aT _ dT 
dai (34, + BE a9 ine) ~ Hae — Bag?) = 0, (2.12a) 
BOE OF OTe 20 (Fe 1 Oe cc NN; Pedy Beda: (2.12b) 


dq; Oy, = Co 80n-es 


Equations (2.12a)—(2.12b) are generalizations of the well-known Jacobi 
equations. 

Equations (2.12a)—(2.12b) must be considered with additional natural 
initial conditions: 


oT =0, (i=32,..., N). (2.13) 


S leemen 
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The system (2.12a)—(2.12b) is of order (NV —1)+/. The number of 
initial conditions, including (2.13), is equal to the order of the system. 

These examples illustrate the trend of the possible extensions of the 
foundations of classical analytical mechanics originating in problems of 
continuum mechanics. 

3. We turn to the problems of analytical mechanics of the continuum 
which we believe to be the most important. We shall concentrate attention 
on two problems. 

First we shall consider the relation between the kinetic-stress functions 
which we introduced in 1953—1954, and a variational principle analogous 
to the Ostrogradskii-Hamilton principle. Then we shall discuss the relation 
between this principle and the basic law of continuum mechanics, which 
establishes the relationship between the kinetic-stress tensor and the 
metric of the deformed space, in particular, the strain tensor. Finally, 
we shall touch upon the general problem of integrating the equations of 
continuum mechanics using extensions of the theorems of classical analytical 
mechanics. 

We shall use the mathematical apparatus of the general theory of 
relativity. This is perfectly natural, since one of the phenomenological 
foundations of the general theory of relativity is classical continuum 
mechanics. We emphasize that the physical principles of the general 
theory of relativity will not be discussed. 

a) It is well-known that the system of equations of motion of an element 
of a continuous medium and the equation of continuity can be represented 
in the form of an invariant equation 


VT" =pF*, F=0, (i, k=1, 2, 3, 4), (3.1) 


where 7“ are the components of the kinetic-stress tensor figuring in 
relation (1.8). 

Consider the case when body forces are absent; let us temporarily 
refrain from postulating the Euclidean properties of the space. Equations 
(3.1) are then identically satisfied by the tensor 


pit = (R* —5 eR), (3.2) 


where R” are the components of the contracted curvature tensor, g’* are 
the contravariant components of the matric tensor, and x is an arbitrary 
constant /29/. In order to set up the tensor of kinetic-stress functions, 
which satisfies the system (3.1) of homogeneous equations identically in 
Euclidean space, we use a method of metric variation which consists in 
giving the components gu of the metric tensor increments e®,, where e is 
a small parameter. Putting x = e! and expanding the right-hand side of 
(3.2) in ascending powers of e, we find 


P* = Qit) + eQin +: °°. (3.3) 
It may be proved that the Qj, identically satisfy equations (3.1) in the 


Euclidean metric in the absence of body forces, and consequently form the 
tensor of kinetic-stress functions /9, 10/. 


74 


KIL'CHEVSKII 


According to theorems in Eddington /29/, the tensor P is the functional 
derivative of the scalar xR 


pir — 8(xR) | 
5810 (3.4) 


The kinetic-stress functions are obtained by letting e —0 in the functional 
derivatives (3.4). 
The functional generating the derivatives (3.4) has the form 


[=x \ RY gan, (3.5) 
(82) 


where 2 is a domain in the four-dimensional space-time manifold. 
Equation (3.4) is related to the condition 


81 =0, (3.6) 


which is formally similar to the Ostrogradskii-Hamilton principle. 

We recall that the main difference between this principle and the 
Ostrogradskii-Hamilton principle is that in determining 5/ the coordinates 
x (i= 1, 2,3, 4) are varied, whereas when applying the Ostrogradskii- 
Hamilton principle they remain constant. Equality (3.6) results from the 
invariance of R under point transformations of the coordinates /29/. 

We note that this method yields ten kinetic-stress functions, correspond- 
ing to the number of quantities determining the energy-momentum tensor. 
However, upon consideration of the geometrical meaning of these functions 
we may assert that the number of functions essential for the solution of 
problems is four, since, without loss of generality, four-dimensional space 
may be provided with an orthogonal coordinate system and a metric tensor 
having four essentially different components. These geometrical considera- 
tions lead directly to the conclusion that in problems of statics the number 
of essentially different stress functions is at most three. In two-dimen- 
Sional dynamical problems of the theory of elasticity, for example in 
problems of shell mechanics, the number of these functions is also three. 

The paper of I.S. Arzhanykh /1/ stimulated the development of the theory 
of kinetic-stress functions. Our kinetic-stress functions have been used 
by S. Kaliski (Poland) and V.N. Ionov in the solution of three-dimensional 
dynamical problems of the theory of elasticity. 

b) As already indicated, extension of methods of analytical mechanics 
to the mechanics of a continuous medium depends largely on selection of 
the field variables or thermomechanical parameters of state. For example, 
L.I. Sedov and M.E. Eglit regard as field variables the components of the 
finite strain tensor D,, those of the stress tensor o”, the temperature T 
and certain other quantities depending on the specific model of the continuous 
medium. 

When this is done, there arise analytical problems with nonholonomic 
relations between these parameters, which enable us to indicate new general 
mathematical descriptions of the processes taking place in various types of 
continuous media /24, 28/. Eglit has shown that this method can be used 
to describe, for example, the relaxation of stresses in some high polymers. 
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Other very general approaches to the choice of field variables can be found 
in the monograph of A.A. Il'yushin /7/ and in other literature on the 
mechanics of a continuous medium. A detailed analysis of these investiga- 
tions is beyond the scope of the present report, but we shall refer to them 
in discussing the problems of continuum mechanics which we have selected 
for further analysis. 

One of the basic problems of the mechanics of a rigid continuous medium 
is the functional relation between the kinetic-stress tensor of the energy- 
momentum tensor and the finite strain tensor. We note preliminarily the 
following. 

As is well-known, classical mechanics of a continuous medium compares 
two states of matter: the initial, undeformed state and the deformed state. 
These states are characterized by their respective metrics. The compo- 
nents of the strain tensor are determined by the differences of the 
corresponding components of the metric tensors, defined in Lagrangian 
coordinate systems, in the deformed and undeformed media. 

Due to the introduction of the strain tensor, its components must be 
included among the field variables instead of those of the metric tensor, 
although the latter is apparently more natural. The expediency of this 
replacement is confirmed, in our opinion, by the investigations of 
E. Kroner /12, 37/, J.D. Eshelby /30/ and others, in continuous 
dislocation theory. These authors were also forced to avoid unconditional 
postulation of the absence of internal curvature of the space connected with 
the deformable medium, and, consequently, to dispense with the St. Venant 
compatibility conditions. The author of the present report arrived at these 
conclusions in 1938 /8/. 

We shall consider two versions of the theory, corresponding to the 
introduction as field variables: a) of the components of the metric tensor, 
which vary in the deformation process; b) of the components of the finite 
strain tensor. The basis of the theory is a variational relation related to 
the d'Alembert-Lagrange principle. Following the statements of J. Leech 
/14/ and the research on the analytical apparatus of the general theory of 
relativity of A. Eddington /29/, T. de Donder /33/ and others, who 
considered a modification of the Ostrogradskii-Hamilton principle, we 
postulate this variational relation. In the first version the field 
variables are the components of the metric tensor g,, the components of the 
kinetic-stress tensor T", the components of the temperature gradient 27 
and the temperature 7. The tensor quantities enable us to construct a set 
of invariants, making use also of the physical tensor characteristics of the 
properties of the medium being studied. 

Some indeterminacy arises here: only in exceptional cases can it be 
eliminated by theoretical considerations; such is the case, for example, in 
the general theory of relatively. Otherwise we may obtain results which 
require experimental verification and may even be devoid of physical 
meaning. 

To set up the equation which expresses the d'Alembert-Lagrange 
principle, we consider a special case. Let us find the elementary work 
performed by the action of force and temperature when the metric is varied 
by deformation of a four-dimensional element of the medium. 
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Let us assume that the variations 6g, of the metric tensor are zero at 
the boundary of the four-dimensional region filled with the medium. We find 


sA= \ (8K —xTbg,, + A(T —T>) dg, + 
(62) 
+ BY: "O,.TAsTogn +-..)V gdQ=0, (3.7) 


where X is an invariant depending on the g, and their derivatives with 
respect to the coordinates x (§= 1, 2,3, 4), 7, is the initial temperature, x, 
A‘, §0/¢ are the physical characteristics of the thermomechanical 
properties of the material. 

With the aid of transformations indicated, for example, in /29/ the 
variational relation (3.7) can be given the following form: 


\ (P®— xT" + A(T —To) + BY O.TOT +...) 88, Vgd2Q=0. (3.8) 
(82) 


Hence we find 
xT = pP™ 4 A®(T—T,.) + BY “O.TOsT +... (3.9) 


The first term on the right-hand side of (3.9) represents the dependence 
of the kinetic-stress tensor on the varying intrinsic metric of the deformable 
space associated with the continuous medium, i.e., on its deformation; the 
remaining terms have a thermomechanical interpretation. It is clear that 
there always exists a tensor A“ corresponding to the addition of terms in 
Hooke's law depending on thermal expansion /17, 38, 39/. 

The term B*4. 49, TdgT allows for the flux of thermal energy, and also 
the increase in entropy /21/. 

According to well-known theorems the tensor P" satisfies the condition 
of energy-momentum conservation /29/. Postulating that the energy- 
momentum tensor obeys the conservation conditions, we define, with 
an accuracy to within, say, a scalar factor, the components of the 
generalized energy-momentum tensor 3* by the expres- 
sion 


So = xT" — A“T — BY "O.TOT +2... (3.10) 
From (3.9) we then find 
ol = pi, (3.11) 


Formula (3.9) or (3.11) expresses the required relation between the 
kinetic stress tensor and the metric tensor. It is simple to introduce in 
(3.9) and (3.11) the strain tensor 


Bin = On + ER (3.12) 


(where gj) are the initial values of the components of the metric tensor) and 
thus to arrive at the second version of describing the stress-strain state 
of a continuous medium. Introduction of the tensor Dy corresponds to the 


special assumption that the variation 6gii) vanish. 
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More general cases are considered in our paper /8/. 
If we put 


mes (3.13) 


in (3.7) (where R is the contracted Riemann-Christoffel tensor), the tensor 
P* assumes the form (3.4), and (3.11) reduces to the Lagrange equations of 
the second kind 


6R _ OR Oo OR _— ah 7 yp _ 
Sin ~ Fn 2 aetdg 2 =F RAND 3, 4, (3.14) 
where 
in, 0 = abn (3.15) 


We recall that equations (3.11) and (3.14) follow from the variational 
relation (3.8) for independent dg,. 

If the components 3 are zero, the variational relation (3.7) becomes a 
variational principle similar to the Ostrogradskii-Hamilton principle, 
encountered in the analytical apparatus of general relativity theory. 

By variation with respect to other variable and completing the integrand 
of (3.7), we obtain equations figuring, for example, in the monographs of 
L.I. Sedov /4/ and I.I. Gol'denblat /5/. In this case nonholonmic relations 
between the field variables considered by Sedov and Eglit /28/ can be 
introduced, with appropriate alterations of the equations known from 
analytical mechanics. These alterations and their geometrical interpreta- 
tion were mentioned above. 

Deformation of a continuous medium can also be regarded as the result 
of some nonholonomic transformation. One should not then assume that 
the quantities 6g!) vanish, but rather relate dg to dgi,) by the following 
relation, indicated in our paper /8/: 


dg) = Bistdg, .. (3.16) 


When this is done the analytical form of the conservation condition for 
the energy-momentum tensor >}* changes. 

Returning to (3.9) and (3.11), which yield the required relation between 
the kinetic stress tensor and the strain tensor, we make some brief 
remarks on the history of the problem. 

The relation between Einstein's gravitational equations and Hooke's law 
was indicated in 1937 by Yu. Rumer /23/. In 1938 we investigated the 
relation between a general law of the form (3.9), without the introduction 
of temperature terms, Hooke's law, the extension of Hooke's law to the 
theory of small elastic-plastic deformations and the relationship between 
the stress tensor and the strain velocity in the mechanics of viscous fluids 
/8/. Difficulties connected with the contradiction between postulation of 
the Euclidean properties of space and the need to introduce the Riemann- 
Christoffel tensor, as well as the lack of information concerning the physics 
of rigid bodies, forced us to interrupt our investigations, although the role 
of the initial stresses, and other factors taken into consideration later by 
E. Kroner and others, were mentioned in the above paper. 
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E. Kroner, R. de Witt and others have pointed out anew the importance 
of the Einstein tensor P/ for the mechanics of continuous media, and have 
given a physical interpretation of the internal curvature of a continuous 
deformable medium on the basis of dislocation theory /12, 30, 37/. 

4. The above problems indicate the diversity of the problems raised by 
the extension of the methods of classical analytical mechanics to continuum 
mechanics. 

Two mutually related problems are, in our opinion, particularly 
important: a) analysis of the conditions for applicability of classical 
variational principles in continuum mechanics; b) extension of the general 
methods for integration of the classical equations of analytical dynamics to 
the integration of the equations of continuum mechanics. 

The latter problem, which is of particular interest, is touched upon, 
for example, in Gol'denblat /5/, J. Leech /14/ and K.A. Lur'e /16/. 

When these problems are worked out, it will apparently be possible to 
indicate new, sound approaches to the solution of the basic problem of the 
mechanics of continuous media—the establishment of a general functional 
relation between the stress and strain tensors. 
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N.N. Krasovskii 


PROBLEMS OF CONTROLLABILITY, OBSERVABILITY, 
AND STABILIZABILITY OF DYNAMIC SYSTEMS 


1. The following three problems are of great importance in the theory 
of controllable dynamic systems, particularly in the theory of optimal 
processes. 

1. The control problem (CP), i.e., the problem of calculating the forces 
which transfer the controlled object from a given state to another. 

2. The observation problem (OP), i.e., the problem of the operation 
which determines the unknown coordinates of the object on the basis of 
observable quantities. 

3. The stabilization problem (SP), i.e., the problem of creating a 
regulating action which ensures stable realization of a given motion on the 
basis of accessible information. 

Each of these problems is often associated with demands for the best 
quality of the process in some sense or another; they then become part of 
the theory of optimal systems. However, CP, OP, and SP are also of 
independent interest; besides, even within the framework of optimality 
problems they display specific features, sometimes also difficulties. They 
are therefore worthy of special consideration. Well-defined methods for 
investigating these problems have now been developed and certain general 
regularities have been established. The purpose of this paper is to discuss 
some of these results. 

The SP, including the problem of optimal stabilization, can be considered 
as a further development of the classical theory of Lyapunov stability /1, 2/. 
The theory of stabilization is closely bound up with the investigations of 
N.G. Chetaev and his students V.V. Rumyantsev and G.K. Pozharitskii on 
the stability of systems in the presence of mechanical forces of different 
natures. CP and OP have been intensely studied recently, particularly in 
connection with the theory of optimal systems. However, they are also 
closely related to the SP, playing an important role in its investigation. 

CP have been considered in the theory of optimal processes, on the basis 

of the maximum principles of L.S. Pontryagin and his school /3/, from 

the viewpoint of R. Bellman's theory of dynamic programming /4/, and 
within the framework of classical variational calculus. R. Kalman /5/ 
studies CP and OP for linear systems under conditions of minimum quadratic 
criterion functions; he formulates the duality between CP and OP and 
considers the canonical representation of linear systems. A significant 

role in the study of CP, OP, and SP is played by the investigation of specific 
controlled mechanical systems, where important results, such as the 
stabilization theory of complex gyroscopic systems developed by A. Yu. 
Ishlinskii /6/, have been obtained. We point out the relation between OP 
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and problems of noise filtering and prediction /7/, which comes to light in 
the statistical development of the problem /8/, the relation between CP and 
the theory of invariant systems, and the relation between CP and OP and 
A.I. Lur'e's problem /9/ of absolute stability, in particular, with the aspect 
of this problem developed by V.M. Popov /10/. Many problems of control 
and observation figure in A.A. Fel'dbaum's theory of optimal systems /11/. 
A number of studies deal with numerical investigation and solution of applied 
CP and OP (see Ya.N. Roitenberg /12/). A satisfactory review of the huge 
literature on CP, OP and SP is beyond the scope of this paper. We have 
therefore mentioned only studies which define some basic direction of 
investigation and give some idea of its general character. For this reason 
we shall not refer to the numerous specific sources in the main text of the 
paper. 

One possible approach to Problems 1 and 2 (and particularly to linear 
CP and OP) istoconsider them as problems of constructing linear operators, 
Satisfying given conditions, on the elements of some functional space. With 
this approach one can evaluate the problems fromthe general standpoint of 
functional analysis, and use, for instance, the extension theorem for linear 
operators /13/ in its specific forms, such as the moment problem, the 
problem of separation of sets, and soon. From this point of view, which 
we, in particular, have adopted in Sverdlovsk, we consider these problems 
in the present paper. 

Problem 3, which develops stability theory, can be approached via the 
problem of synthesis of optimal systems, in particular the problem of 
analytical regulator design. A basic source of this point of view, which 
we adopt here, is the research of A.M. Letov /14/. These methods of SP 
investigation develop results of Lyapunov, Chetaev and others in the theory 
of stability of motion, in particular, the method of Lyapunov functions, 
modernized in accordance with the principles of dynamic programming. 

2. Let us consider the control problem. Suppose a dynamic system is 
described by ordinary differential equations 


dx 
S =flt. x al, (2.1) 


where x is the vector of phase coordinates x, (i= 1,..., 2) of the object (for 
example, n= 2s, x,,...,%, may be the generalized coordinates, and x,,,..., 
x, the generalized velocities, of a holonomic mechanical system) and ug; is 
the vector of control actions uw; (j= 1,..., 7). 

The coordinates uw, are either forces actually applied to the object, or 
quantities related in some way to the forces and introduced into (2.1) ina 
convenient form. Finally, the relation between the u, and the x, may be 
more general than (2.1). 

Problem 2.1. Given the initial state {f, x} of the object and the manifold 
of its final states («), x), described by parametric equations : 


x") = s(z}, (=e? + 6[2), (2.2) 
where z is a k-dimensional vector of parameters, restricted perhaps, by 


some condition which we denote symbolically by z@Z. There may be 
additional restrictions and constraints. 


83 


KRA SOVSKII 


The problem is to determine values 
z= 7 and the functions u,;() (@<t< 
<(°-+ 6) satisfying the given restrictions, 
such that for u=u (a motion x (é) 
starting at the point x (f°) = x reaches 
the point x (¢) = s [2]. 

The conditions determining the 
manifold M may preassign not only the 
final values of x), but also the values 
of x (#) at any other instant (including 
t=f), M being then determined by 
equations x (t) = 4 (z], HO =  4- 6 (2), 
where {i) is a finite or infinite index set. 
In this way we can, in particular, allow 
for restrictions on the motion coordinates 
x(9Q. The formulation is similar if there 
are infinitely many degrees of freedom. 

Example 2.1. Consider an s-link 
plane hinged mechanism (Figure 1) in 

a the gravitational field with masses 

m (i= 1,...,8), controlled by a horizontal 

force u applied at one of the points m,, or byamoment u appliedat the j-th 
link. Let &, 1, be the coordinates of m,;. The problem may be to determine 
an action u(t) which, during a given time 0<t<T, moves the system from 
the initial states § # to a state in which all the masses m, lie along the 
same straight line {7 = (4, +...+ 4) 2, t= 0 inside a given interval 
a<z,<f. The number T may be given; for example, T=A([z] =z, 
subject to the restriction 


P =(|u(s)|de <1 (2.3) 


on the impulse P of the force u(. If in the latter case it is required to find 
z {z,, z,} and u (for which the problem is solvable with the smallest z,=T, 
we obtain an optimal speed problem. 

3. Consider the observation problem. Suppose the dynamic system 
(2.1) is observed on the basis of the quantity 


y=qglt, x), (3.1) 


where y is the vector of observable coordinates y, (i= 1,..., m). 

The vector y is generally of lower dimension than x, Accordingly, it 
is assumed that x cannot be found directly from y using (3.1). The acting 
forces u (f) are assumed known. 

Problem 3.1. Given parametric equations 


6 = 6[z], r= 7 (x, 2], (3.2) 
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which determine the required quantity y and a time interval tq 1<t+6. 
The problem reduces to determining an operator @ lf, y(@@+1),u(@¢+r),0Q1T<¢ 
< 6), which reconstructs the quantity 7 for selected z from observation of 

y (t)and u (t)in the past, i.e., 


Vix(t+96), 2] =@(4, y(¢ +4), u(l+rt); Or < 8}, (3.3) 


The parameter z may be introduced with an eye to improving the quality 
of the observation (see below). A typical case occurs when 7 = 2,, which 
must be known for regulation purposes. In amore general form, one may 
require a vector operator I determining the whole vector of phase coor- 
dinates x at once, i.e., x¢+ 6) =f [¢, y ¢@4+ 17), u@+1),041< 8). Itis 
natural to generalize the formulation of the problem to systems with 
infinitely many degrees of freedom. The observable vector y (t) can be 
related to x (¢)in a more general manner; for example, a differential relation 
dy/dt = q (t, y, x, uJ taking the inertia of the measuring apparatus into account. 

Example 3.1. Consider the system of Example 2.1. Suppose the 
rotational velocity » of the beam /, is being measured and a number 6>0 
is given. The problem may be to determine operators @ (i = 1,..., 2s) which 
at each instant ¢+ ® reconstruct the coordinates &, (¢+ 6) and the velocities 
& (¢+ 6) of all the masses m, from the observed velocity » (¢ + 1) (and force 
(¢+ +)) in the preceding time interval ¢+1(O0<1< 86), i.e., &¢+6)= 
=gmipt( truth & + 0) =u p+, ut + r)i. 

4. Consider the problem of 2.1 in the linear approximation. Suppose 
equation (2.1) is of the form 


dx 
GT ~AMOe4+ B(b)u, (4.1) 
where A, B are matrices of appropriate dimensions. Using the Cauchy 
t 
formula x4) = X (®t) 2+ (xe, t)X-! (, 6] B (6) u(®) d® and (2.2), we 
te 


transform (4.1) into the equation 


G[z, t®, t)o()dt[z, t°, t}] =¢(z), (4.2) 


ow" 7®@ 


a” 
/ 


where G is expressed in terms of X and B, and the functions ov and ¢ are 
related to the required function u(#). The purpose of this transformation is 
to interpret the left-hand side of (4.2) as the result of applying some linear 
operator g, generated by the functions o and ¢, to the given element G. 

When this is done the given restriction should be interpreted as a restriction 
on the norm of g, i.e., 


p19) < 1, (4.3) 
or 


e°(~]) = min, etc. (4.4) 
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Expressing (4.2) in terms of the coordinates, 
6 
Voceattz, ee, rpg [z, &, t= cr (2), (4.5) 
) 


the CP is conveniently interpreted as a moment problem: find a linear 
operator (functional) @ which satisfies conditions (4.3) or (4.4) and 


ele} =a (i =1, 2,...) (4.6) 


Example 4.1. In example 2.1, take m,=1,s=1, /,=1, 8 =1 with the 
restriction (2.3) on the impulse P of the force u (t); if the mass is brought 
to the upper equilibrium position, the linear approximation yields equations 
(4.5) in the form 


e 
y {cht} = (ch tdf(t) =), 
: (4.7) 
@ [sh vt) = \sh td.f (+) = cy, 
and condition (2.3) in the form (4.3) is 
e 
ele) =Vldcni <i, (4.8) 


where [((t) is a function of bounded variation; d,  (*) =u (x) dt, and thus u(f 
has the form of instantaneous impulsive actions. 

According to Hahn's theorem, the moment problem (4.3), (4.6) is solvable 
if and only if the inequality |2A«<;| < p (fA,.g) is satisfied for any finite set 
{4}. In particular, in the finite-dimensional case, we wee that the CP is 
solvable for a given z when 


a(z, 2) =minp(Ag)>0 for c(z)-A=1, (4.9) 


where p(g) is the norm in the functional space over which the operator @9 is 
being considered. 

The minimum norm 69° (9°) is determined by the equality p’ (9) = Ie [z, &), 
and the optimum functional g¢ itself satisfies the condition 9° I, g°) = 
= max, (p If, g°)) = 1, for p* (@) = Ia, where g° = £ Avg is the solution of the 
problem (4.9). This condition corresponds to the maximum principle. 
Consequently, the CP is solvable under the restriction (4.3) if a (#*) = 
= max,@ (f, z) > Ilfor z@Z or if supa (f, z) > 1 for z@Z when the upper bound 
is not attained. The existence of a (f) may be proved in a fairly wide class 
of cases. 

In the finite-dimensional case the problem (4.9) may be solved 
numerically. In infinite-dimensional cases, which are of interest in 
mechanics, actual investigation of problems similar to (4.9) seems difficult 
although the general formulations remain in force. 

In examples (4.7), (4.8) the norm 9 (g) is defined by p (g) = max, (|g (*)|, 
0<+<6). Consequently, the smallest impulse P* of the force u’ (necessary 
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to bring the system to an equilibrium state is defined by P® = I/a, where 
= min, max, [|A, cht + A,sht], 0< + <6] for A,£° —1,f°= —1; the problem is 
solvable in the time 6 under condition (2.3) if «>1. The optimum action 
u° (f) is pure impulsive action u® (f) = x,6 (¢ —¢,) + 6 (¢ — 4). 
Continuing the above approach, it is also very natural to study the 
variation of the optimum force u° (f) for different forms of the norm 69° (9). 
For instance, suppose the minimized quantity is estimated by the integral 


6 
Ne (x)? ae]? = min, (l<p< oo), For p= o this limits the maximum action 
° 


max;ju (f)|== min; for p= 2 we get the minimum ''power," and for p= 1 the 
minimum impulse. The behavior of the optimum action u° () is shown in 
Figure 2. In considering the problem of optimum speed from the present 
viewpoint, it is natural to adopt the following numerical method, by which 
the maximum principle actually yields initial values of the vector » which 
ensure the system's reaching the given state. By the previous reasoning, 
the optimum time 6° under restriction (4.3) is the smallest root of the 
equation a=I1(ora>1). We therefore take some 6 and solve the problem 
(4.9). Using the resulting 4 we replace @ by an improved value 6), so that 
a@=1 is satisfied. We again solve the problem (4.9) and using the resulting 
4) correct 6 to 6, and soon. The process converges under quite general 
assumptions. Since the quantity being minimized is a norm, problem (4.9) 
can be solved, for example, by the method of descent, without serious 
complications. This procedure may also be used to obtain the motion on the 
manifold (2.2) in more general cases, a game Situation with respect to A 
and z being typical. 

5. Consider Problem 3.1 in the linear approximation. We confine 
ourselves to the case u=0, since the solution of the problem for u#0 
follows from the solution of the observation problem for a free system. 
Suppose equations (2.1) and (3.1) have the form 


F=H(t)x, y= Q(t), (5.1) 


where # and Q are matrices of appropriate dimensions. The number 6 in 
(3.2) is considered given and we assume y = p(z) x. The problem (3.1) is 
now to determine a linear operator (functional) 9 It, y ¢ + +)] such that 

q@ It, y (¢+ +)) = p (z): x (¢ + 6) for all ¢ in the relevant time interval. If the 
conditions of the problem reduce to (4.3) or (4.4) the problem again reduces 
naturally to a moment problem. In fact, we have an equation x (¢+ 1) = 

= F [t,t+x(, 9 lt, Q t+) F [t, t+) x (Ol=p (2) Fit,t+01x. Using 

the linearity of g and equating terms in the same coordinates x, (f, we again 
obtain the moment problem (4.6), where the elements gt (z, t,t] and c (z, 
are expressed explicitly in terms of Q, F, and p. Hence follow the 
solvability conditions and solution methods of the OP, similar to those 
indicated above for the CP. This is natural, since the problems are duals 
of each other. The approach to these problems by way of the moment 
problem enables us to trace this relation conveniently and to interpret the 
results. For definiteness, let us consider only cases when in equations 
(4.1) and (5.1) u'and y are scalars, Band Qare vectors 6 and q. The 
elements g” in conditions (4.6) for CP and OP can then be taken in the form 
Bio (t) =F xy, C+ 8) 7h (PLO + tbe (O + 2). BD (Im Dd Ju (tt + 8) fal + ta +0), 
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where xy, xj}, fi, f;} are the elements of the fundamental matrices of the 
systems (4.1) and (5.1) respectively, and ¢e¢ (2) = $(z) — X (0, & + 6) x, 

Co (2) = p (2). From well-known properties of the solutions of adjoint systems 
it follows that when H = —A’and 6=q (where « stands for transposition) 
for t=, the elements g” and g? coincide. Consequently, the OP for 

Y = p-x is equivalent to the CP at the point s(z)from the point x° where 
p=s—X It, t, + 8) x. 


Conditions (4.9), which follow for the CP and OP from the L-problem of 
moments, can be given an intuitive interpretation, establishing a clear 
connection with the theory of games and problems of set separation. 

6. Let us consider the OP from the following standpoint. Suppose the 
quantity y (Q is being measured, with error w(f). The quantity r@=y(Q+ 
+ w(f) will be called the total signal, w(f the noise, and y (f) the effective 
signal. We assume that an estimate of the intensity p (w) is given; in the 
sequel, for definiteness, p(w) =maxy|m (|. In general, suppose the function 
7 (t) is defined for 0< +< ® and consider it as an element of some function 
space with norm p(n). p(n) will be called the intensity of the signal  (t). 
Suppose the functions {(s) (0<+< 6) define linear operators (functionals) 


6 

on y (s) with norm p° (¢) = sup [¢ In], pm) = 1). (In our case { [n) = \n (x) def (x), 

t ° 
er @= (iat (x)|.) p* @) will be called the intensity of the signal ¢ conjugate 

e 
top. Conversely, if a function n (t)with intensity p (n)is chosen from a class 
of functions which define linear functionals n (—]on functions € (z), then the 
norm o, (§) will be called the initial intensity for p(n). The quantity 


A = supe ¢@ It, w (¢ + 1)] for p(w) < 6) will be called the error of the operator 
due to noise p (w) <6. The problem of observation with the least error 


KRA SOVSKII 


for p (w) <5 is to find an operator @ with the lowest intensity p° (¢) of the 
function generating @. We shall say that the effective signal y (¢ + t) (0< t < 8) 
carries the quantity y, if it is generated by a motion of system (5.1), where 
p-x(t+6)=y. The signal y (¢+ +) carrying the quantity y = 1 and having 
the least possible intensity p (y) will be called minimum and denoted by 

{y° ((+ +) Ip). The relations indicated on page 86 are now interpreted as 
follows. The problem of observation of the quantity p-x (¢ + 6) on the basis 
of y(¢+ +) is solvable if an only if there exists z@Z such that the intensity 
p () of the minimum signal [y° (¢ + +) |p] is nonzero for all relevant ¢. The 
lower limit A®°® of the error A for p(w) <8, when p-x (¢ + 8) is being deter- 
mined from the total signal r (¢+ +), isdetermined by A° = 6/a, a = sup, inf; (¢, 2). 
Here a (ft, z) is the intensity p(y). If sup, inf;a is attained, there exists an 
optimum operator solution @° for which A= A®. In this case the optimum 
signal solution [° (f, t) has conjugate intensity p° (¢°) = Wa (f) and it is the only 
signal with p* ({) = I/a (Q such that the operator @° generated by the function 
t° gives the largest possible value on [y (¢ + +)|pl. 

Example 6.1. In the system of example 4.1, suppose the velocity & (¢) 
is being measured on the basis of the coordinate § ( (for u = 0) and 
ep (w) = max; |w (O]. Then the minimum effective signal » (¢ + t) is a motion 
§° (x) of the system (0 < t < 0) whose final velocity is 1, and the maximum 
deviation a = max|€ (t)| during the motion is the least possible (see Figure 2). 
The optimum reconstructive operator on any motion § (t)has the form 
E(t + 0) = x,& (¢ + 0,) + ,& (¢ + 6), where 6, and 6, are the instants of largest 
deviations of the motion £° (x), |x,|+ |»,|= la, and & (6) = 1 on & (2). 

Taking the duality relation into account, we have the following assertion 
for the CP: under the restriction p(u) < I! the CP is solvable if an only if 
there exists z@Z such that the initial intensity p, (¥) of the minimum signal 
ly? (¢° + x)|e.]) 0 < + <8 ([z]) of the observed adjoint system y = p-x (¢ + 6), 

y = B’x, dx/dt = —A*x satisfies the condition p, (¥) > 1, where p=s—X [f, &+ 
+ 6) x. With this choice of z, the optimum (with respect to the intensity 

ep (u)) control signal u° (@ + +) has intensity p (u°) = I/a, where a=, (¥), and 
it is the only signal with intensity p = l/a such that the operator ¢ generated 
by the control u® gives the largest possible value on the minimum Signal q°. 

In particular, in Example 5.1 the moment u° ( with the smallest impulse 
which brings the system from state & = & = 0to state — = 0, § = v during 
time @(in the linear approximation), is the reciprocal of the smallest 
attainable maximum deviation |§ (t)| (0<+< 6) a motion & (t)of the same 
system, provided that at the final instant vé (0) = 1. The impulses of the 
optimum action w® (tf) must be applied at the same instants ¢, when the 
quantity y (xs) = § (ts) is measured in the adjoint observation problem, i.e., 
at the instants when the deviation |§€ (t)| is a maximum in the minimum 
signal y(t) = — (x). The optimum intensity p (u°) of the control signal u®is 
then proportional to the magnitude of the optimum error arising in the 
adjoint observation problem in the presence of noise w with intensity p, (w). 

Thus the optimum operator ¢° generates the largest possible ° [y°) under 
the most unfavorable conditions for the effective signal . As indicated, 
this property of ¢° follows from the solution of Problems 2.1 and 3.1, based 
on the theory of the L-problem of moments. It is useful, however, to 
interpret these results from yet another point of view. 

7. Lef us consider the process of observation of the quantity y = p-x (4 
on the basis of y () as a game played between the observer and the object. 
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Since the noise w (f of intensity p(w) <6 is fairly irregular, we assume that 
for any choice of reconstructive operator @ the most unfavorable case of w 
is possible, giving an error A =9@ (w) = 5p’ (9), i.e., the error 4 = #6 is 
determined by the given intensity p(w) < 6 and p’(@)=6. Thus the quality 
of the effective observation may be improved only by increasing the result 
of applying the operator @ to the effective signal. Accordingly, we shall 
not make further allowance for the noise wand consider the following game. 

Game 7.1. The strategies of the first player are the operators 9 [t, n (t)) 
with intensity p® (@) < 1, and those of the second player are the signals y (t) 
(5.1), where x (¢ + ®)-p (z) =}. The values of ¢,0, and z are fixed. The 
utility function is p le, y) =@ It, y (¢+)). The aim of the first player is to 
attain the largest possible value of p, while that of the second player to 
attain the smallest possible value of p. 

Game 7.1 may be studied, as usual, using set-separation theorems. In 
this case the functional @° solving the problem separates the corresponding 
sets. The game 7.1 has a saddle point (¢°. ¥), where plf°. ¥] = max, mingp, 
and 


Hip yl >pile’, 9°), BIO, YI<BIG’, 9°). (7.1) 


Consequently, qg®° and ¢ are indeed the best strategies for both players. 
The optimum strategy y°is the minimum signal (¥ (¢ + t)|p], the optimum 
strategy @°is, up to a factora, the optimum operator solution, i.e., 

g? =ag? It. y (+ 0). 

For all y (¢ + +) of (5.1), where p-x (¢-+ 8) = 1, the operator ag® gives 
exactly p-x = 1, and no more; the latter may occur in the general case due 
to the properties (7.1) of the saddle point, since one of the sets being 
separated, i.e., the set {y (¢+1)), is two-dimensional. 

Thus, we have an analogy with a well-known property of active strategies 


® 
of a mixed finite game. If the intensity p° (@) is defined by (12g) =p, then 
° 


this analogy is carried further. In fact, let Player 1 have only pure 
strategies (+ (r, @) of the form d, C= +6 (t — 6) drt, where @ varies over the 
interval {0,6). Then the optimum strategy (¢° (t) which generates the operator 
e 
¢g® ly) =(y (x) d,¢° can be interpreted as a (probabilistic) mixed strategy, 
®@ 


where the pure strategies + appear with probabilities |d,f° (6)|, i.e., 


o° (yl = (dete (8)[(y (dt (x, 99]. 


Let us discuss the connection between the above relations and the 
maximum principle. Consider, for example, the CP for the system (2.1) 
given p (uv) = max, |u (t)|= min andr=1. By the maximum principle, u° (t) 
is determined by the condition H (u® (x), p° (t)) = max, H (u (t), p° (x)= 
= max (u (t)-~° (t)-6) for |u (t)| <p (vu), where »° (A is the solution of the 
equation dy/di = —A*yp. By the duality principle, the CP corresponds to the 


6 
OP, to which in turn corresponds the game7.1, where p lu, yJ= ( Hae, y =» (0-6. 
e 
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For the type of problems under consideration, the analogy with the game7.1 
therefore points to the interpretation of the conditions of the maximum 
principle as conditions of the maximin of the corresponding game. It is 
emphasized in this case that the element »p° figuring in the maximum 
principle must satisfy the maximin condition for the game 


um lie, ») dé. 


The connection between the CP and OP and the game 7.1 stems in 
particular from the fact that the L-problem of moments and the problem 
of set separation, on which the investigation of games is uSually based, have 
similar formulations and are studied by methods based on the extension 
theorem for linear operators. 

8. Let us discuss the controllability (C) and observability (O) criteria 
expressed in terms of the coefficients of equations (4.1) and (5.1). For the 
systems (4.1) and (5.1) to be fully C and O, i.e., for the existence of a 
control u (4) and an operator @ which solve the CP and OP for any 2, x and 
p, it is necessary and sufficient that for any ¢ condition (4.9) should be 
satisfied. If A, B,H and Q are constant, the necessary and sufficient 
condition is that the ranks of the matrices {L‘} = (B, AB,..., A B)) and 
{R*} = (VY, HO’, ..., 4%"! Q’) ben. In the case of time-dependent and 
differentiable coefficients, sufficient C and O conditions in the interval 
t<+t<t+6 are that for at least one t the ranks of the matrices {L) = 
= (L,,...,Ln) and {R) = (R,,..., Ra), where L,=B, Lewy = dli/dt —ALy, 

R, = @, Rei = dR/dt —H°R,, ben. Of importance for the CP are the 
properties of uniform (with respect to @ or ¢) C and O in intervals [t, ¢ + 6] 
of fixed length 6, when it is required that the intensities of the control p° (u) 
and the operator solutions p° (g) be uniformly bounded for all © >0or t>0 
for a selected set of bounded 2x, xr and p. This is ensured if condition (4.9) 
is satisfied uniformly, i.e., «()>65>0(ora()>65>0). In the case of 
constant coefficients C and O are always uniform; in the case of variable 
coefficients with bounded derivatives of order up to a, sufficient conditions 
for uniform C and O can be formulated as follows. System (4.1) is uniformly 
C in intervals (¢@, @ + 6), if for » >0 in each such interval there exists at 
least one point t for which there exists a set of numbers j, (k= 1,..., a) such 
that 


DE Ua» TIE Lie <I) dche > DA. (8.1) 


Here / [j, ts] are the columns of the matrix {L}. System (5.1) is uniformly 
O in the intervals [#,¢+ 01, if for some p> 0 in each such interval a point 
exists such that 


Dr lies Tr Linn €)) Ade > DA. (8.2) 
Here rf (j, t) are the columns of {R}. Conditional C and O are also of interest 


for the SP. System (4.1) may be called conditionally C in the subspace N, 
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if for any * & Nand mE N there exists a control u(t) which transfers x () 
from x to x, System (5.1) is conditionally O in the subspace MW, if the O 
quantity y= p-x for pE@N. Systems are conditionally C and O in Nif 
condition (4.9) is satisfied for vectors c, corresponding to {x, x), ppeN. 
For this, in turn, it is sufficient that the subspace W be contained in the 
subspace generated by the column-vectors of the matrices {L‘} and {R*‘}, 
respectively. 

The C property is in a certain sense a negation of the invariance 
property of the coordinates of a controllable system under the action u (A). 
Consequently, the form of the invariance conditions should be the negation 
of the controllability criteria. From this point of view it is natural to 
conclude that the coordinates of system (4.1) may be invariant under u if 
the rank of {L‘} is less than n, and the coordinate c-x is invariant under u 
if the vector ¢ is orthogonal to {L‘}. 

The problem now arises of classifying all the coordinates of a linear 
system according to the C and O criteria. This problem is solved by 
canonical representation of the linear system: by linear substitution of the 
coordinates the system can be brought into a form in which all the new 
coordinates fall into four classes: {C and O}, {C only}, {O only}, {neither 
C nor O}. 

9. Let us formulate the SP. Suppose a dynamic system is described 
by equation (2.1). We shall distinguish two cases: 1) complete feedback 
information: 2) incomplete feedback information. 

In the first case the problem is formulated as follows. 

Problem 9.1. To find an equation 


g{ulm, ulm), wo, U, X, bY) = 0, (9.1) 


such that the motion x = 0,u =0 of system (2.1, 9.1) is asymptotically 
Lyapunov stable. 

The stability requirements may be extended to a given region of initial 
perturbations, including also stability in the large. The order m of the 
equation may be given (the value of m can be determined from the auxiliary 
conditions of the problem). We shall consider only the cases m= 0 and 
m= 1, since even in the case of larger m the problem is usually reducible 
to these. 

In the second case the problem can be formulated as follows. 

Problem 9.2. To find a delay differential equation 


oo =UIt, y(t+), u(t+%); —0<0<0), (9.2) 


such that the motion x=0,u=0 of systems (2.1, 3.1 and 9.2) is asympto- 
tically Lyapunov stable. 

Example 9.1. The following problem can be formulated for system (2.1): 
suppose the angular velocity »(f is being measured; the regulating moment 
u (is applied to the link /,. Find a regulation law du/dt = U (t,»p (¢ + 9), 
u(¢+ 9), (—1<6< 0), which ensures asymptotic stability of the system 
in the position &=&=—0O(i—1,...,5). Stable realization of the state 
E = & =0 using only »( and u (is impossible, unless an aftereffect is 
introduced in the regulation law. 
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10. Consider Problem 9.1 in the linear approximation, for m=1. The 
problem is to select a regulation law 


= Aix + Buu, (10.1) 


for which the motion x=0,u=0 of systems (4.1, 10.1) is asymptotically 
stable. In general, the SP does not have a unique solution. A convenient 
approach to the problem, which singles out one definite solution, consists 
in parallel solution of the problem of analytical construction of the regulator 
(10.1), which minimizes the integral 


L=\o (t, x(t), u(Q), u(Q} de, (10.2) 


CB 


where o is a quadratic form. This problem can be solved effectively, 
obtaining simultaneously a Lyapunov function vu (¢, x, u) satisfying the 
condition du/df = —w [t, x, u, u] which can be used in the further development 
of the problem. A sufficient stabilization condition for a system with 
variable coefficients is the existence of a number ® >0 such that system 
(4.1) is uniformly controllable in each interval [¢, © + 6} (¢ > 0) (in the case 
m= 1 sufficient regularity of B(¢t) must also be assumed). The stability of 
a closed system is then found to be uniform; this is important in passing to 
a nonlinear system. If (4.1) has constant coefficients, the known structure 
of the motions makes it possible to formulate a more precise stabilization 
(S) criterion, with a clear geometrical meaning. In the process of stable 
regulation in the present case, it is sufficient to control the intrinsically 
unstable coordinates of the object. TheS criterion therefore reads as 
follows: in order that the system (4.1) be anS regulator (10.1) for A = const 
and B =const, it is necessary and sufficient that the root subspace K {7°} of 
the matrix A, corresponding to its eigenvalues 47° with nonnegative real 
part, be contained in the subspace generated by the column-vectors of the 
matrix {L‘}. Here and below the term root subspace K {A4,}) corresponding 
to a set {4}, means the following: let T be a matrix which transforms 

into Jordan form G -T“AT. Then K {A,) is the linear subspace generated 
by the columns 7, with the same indexes as those of dA, in the matrix G. 

If A is simple in structure the subspace K {A,} is generated by the corre- 
sponding eigenvectors S, of the matrix A. 

The stabilization criterion can also be given a somewhat different form. 
Let A,,...,A, be all the eigenvalues of the matrix A with Rea, >0, and 
suppose each eigenvalue 4; = —A, of the conjugate matrix A° has q, 
eigenvectors Sj’. System (4.1) is stabilized by the control (10.1) if and only 
if the rank of each matrix L'? = Sf? & is q , where 6 are the columns of the 
matrix B. 

Consider Problem (9.2) in the linear approximation. 

A sufficient condition for stabilization of system (4.1) by the regulator 


° 
= | y(t + 0)deA, (t, 0) + u(t + 9) doB, (¢, )) (10.3) 
a) 
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is that in any interval [¢,¢+ 6] this system is uniformly C and uniformly O. 
Here too the stability of a closed system is found to be uniform. 

If A, B and Q are constants a more precise Stabilizability criterion may 
be formulated, since it is then sufficient to control only the unstable 
coordinates and to observe only those coordinates necessary for such 
control. 

Let L [j;,..., jm) (m <r) denote the linear subspace generated by the 
column-vectors of the matrix ({L‘} with indexes j = jf, + ir (G=0,...,(2 —1)) . 
We denote by R the subspace generated by the column-vectors of {R} (for 
H=A). System (4.1, 5.1) is an S-regulator (10.3) if there exists a set of 
numbers j, such that K (47°) CL l[j,,...,jm] CR. Let us formulate another 
form of the criterion (for definiteness we assume that u and y are scalars). 
Let A,,.......,Am be all the eigenvalues of the matrix A with Reda, >0, 
one eigenvector SU” corresponding to the eigenvalue 4; = —A, of the conjugate 
matrix A*. If all the scalar products 6S‘ and gS” are nonzero, the system 
dx/dt = Ax + bu, y= qx, is an S-regulator (10.3). The nature of the functions 
A, (4 and 8B, (®) in (10.3) is determined by the function space p (n) in which 
the corresponding L-problem of moments is being solved. 

11. Let us discuss the CP, OP and SP for nonlinear systems. A general 
theory of nonlinear C, O, and S problems does not seem to exist at the 
present time. One may speak only of the theory of C, O, and S of quasi- 
linear systems, obtained by investigating the problems in the linear 
approximation. One may then establish an analogy with the theory of 
Stability in the first approximation and with the method of small parameters 
in the theory of oscillations. 

Consider the systems 


& = Act Bu+fit, x, ul, y=Qetait, x), (11.1) 


f= Act Butps (t,x, un), y= Qe+ngle, x, p). (11.2) 


System (11.1) will be considered in the neighborhood of the point x= 0, 
u=0, assuming that the order of smallness of fand g with respect to x and 
u is higher than one; system (11.2) will be considered in a finite, sufficiently 
large region, assuming yp to be a small parameter. In the event that we 
expand the solutions in series, the functions / and q will be assumed 
analytic. The basic result regarding the CP, OP, and SP for systems 
(11.1) and (11.2) is: if the systems are completely controllable, completely 
observable, and stabilizable in the linear approximation, then the quasi- 
linear systems are alsoC, O, andS; the quasilinear solutions may be 
expanded in series (for example, for (11.2) in powers of »p). When solving 
the SP of (11.1) and (11.2) in the nonstationary case one should check the 
uniform C and O of the linear approximations. 

The above cases will be called noncritical C, O, and S cases for (11.1) 
(11.2). As an example we may consider the problem of an unbalanced beam 
(Figure 3). The equation of motion has the form @P&/d# = —psin&E+u. In 
the linear approximation the system is C andO. Therefore for any initial 
conditions €°, n° = —°, a control u (t) = Tus (t)p* can be constructed which brings 
the beam to the state §—=0,7n=0. 

One may constructs an operator which reconstructs the velocity n (¢ + 6) on 
the basis of the coordinate §(¢+ 1) 0<1< 86), nm (¢+ 6) = Dptq, (4, E(t + 2), 
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u(¢++)), and finally construct a regulator dwat= Ip*/, lt, &,n, uJ which 
stabilizes the beam in the position § = n = 0 and minimizes, for example, 

eo 
the quantity (8 +n? +o + Zyros (¢, & 0, u)) df. 

0 

It is interesting to note that the dependence of the solutions of the C, O, 

and S problems on p may turn out to be fairly regular even if the solutions 
themselves are fairly irregular, forexample, discontinuous. Asanexample, 
we consider the problem of the fastest relaxation of an unbalanced beam in 
the lower stable equilibrium position by a moment u (f of bounded magnitude, 
max;|u (f)|< 1. The phase plane of the optimal system has the form depicted 
in Figure 3, where.the trajectories of the optimal system for » = 0 are 
shown (the solid curves represent the motions, the broken curves, the 
isochrones, i.e., the curves separating regions of attraction of equilibrium 
positions which differ from each other by a period x). The corresponding 
curves for small p are also shown. The equations of these curves have the 
form of series in p; the equations of the lines for p = 0 are generators. 


< 


We mention that it is also possible to formulate a certain controllability 
criterion for nonlinear systems of a more general form than (11.1, 11.2). 
Namely, if the first approximation system at the point x= 0 is fully 
controllable for dx/dt = f [t, x,y), and in the neighborhood of each motion x (ft) 
generated by a control u (f) of a certain class this system is uniformly 
completely controllable, then for each point x of the given region G there 
exists a control which brings the motion x() to the pcint x=0. For fairly 
well-behaved systems this criterion can be expressed as conditions on the 
partial derivatives of/dx,and Of/du. Ifitis satisfied and the existence of u (A 
as a function of time is affirmed, we may ask whether it is possible to 
synthesize a stable system with a sufficiently smooth control law u = g (x). 

This question is partly answered by the following results. If for the 

LJ 
system dx/dt = fi (x) + /™ (x) u the problem of optimal control \ ig (x (9) + 
6 
+ g) (x (0) u® ()) df = min has a unique solution in the region G(the problem 
has a solution wnen the above criterion is satisfied), it is possible to 
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synthesize a system stable in G with a smooth control function. The latter 
results, however, are more of the nature of existence theorems, since 
actual construction of u () following the proof involves serious difficulties. 

12. Cases in whichC, O, or S systems (11.1), (11.2) cannot be solved 
in the first approximation are naturally called critical. 

We shall only consider crticial S-cases for system (11.1) in the case of 
complete feedback information. The basic result can be given a clear 
geometrical formulation. 

Consider again the matrix T which reduces A to Jordan form and the 
space W{L‘} generated by the columns Lj. If all the vectors 7, correspond- 
ing to Rea, >0 are in W then the unperturbed motion x= 0 of the system 
dx/dt = Ax + Bu + f{ [x, u) is stabilized by the regulator u = g (x) = Px inde- 
pendently of f; if at least one vector 7, with Rea, >0is not in W, stabilization 
is impossible; if all T; with Rea, =Oare in W, and atleast one vector T; with 
Re > Ois notin W, then the possibility of S and the solution u = g [x], if it 
exists, is determined by higher order infinitesimals of f. Thus, there is a 
full analogy with Lyapunov's stability criteria on the basis of the first 
approximation, the problem being solved here naturally by that part of the 
spectrum of the matrix A of the first approximation whose root vectors 7, 
are not contained in W. These decompositions of the space are consistent, 
since both W and the subspaces generated by the vectors 7, are invariant 
subspaces of the linear approximation system. 

The investigation of critical stabilization cases is developed in analogy 
with Lyapunov's well-known theory. In this case, however, the possible 
variation of the regulator u, even within the limits of linear control u= Px, 
introduces essential new features. 

13. Let us discuss the form assumed by the above relationships in the 
case of controllable mechanical systems. We confine ourselves to motions 
of holonomic systems in the neighborhood of equilibrium, assuming that 
only one control action is applied to the systems and that there is only one 
observable coordinate (u and y are scalars). Suppose the system is 
described by generalized coordinates q (i= 1,...,s), which vanish in the 
equilibrium position. We shall say that the system is controlled by the 
coordinate gq, if the term depending on u appears in the Lagrange equation 
corresponding to this coordinate, and that it is observed on the basis of 
g,, if y=q,. In the sequel it is convenient to assume that in the first 
approximation the kinetic energy is expressed in terms of the coordinates 
gq: as a sum of squares. We reduce the system in the linear approximation 
to normal coordinates §& n= & and obtain & =, n, = 4% +a;u. The 
conditions of complete C assume the form 4; #4; and a;#0, i.e., the system 
is completely C by a single coordinate gq, if and only if all its eigenvalues 4A, 
are distinct and the axis g, is not orthogonal to any canonical axis & of the 
characteristic motions of the free system. Similarly, a system is completely 
O on the basis of the coordinate g, if an only if all the numbers 4A,are distinct 
and the axis g; is not orthogonal to the characteristic canonical axes &. A 
system is completely O on the basis of the velocity g,, if in additionto the 
above conditions there are no zero eigenvalues A,(Figure 4). In particular, 
under these conditions there always exists a control in the form of a finite 
set of pulses u (f) = £x,5 (¢ —f&), and any coordinate gq, or velocity 9; can be 
reconstructed by measuring the observable coordinate gq, = y (t) or velocity 
g; at discrete instants ¢,. The stabilization conditions assume the following 
form: 1) If 4¢=4, anda; #0, the system is stabilizable by the control 
um D (6.9; + ve); 2) if Aye A), a; 0 and the axis gq is not orthogonal to 8, 
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(or also 4; #0), the system is stabilizable by the control du/d = 
° ° 


= ( (qn (¢ + 8) doB (0) + u (f + 9) de y (8)} (or du/dt = { (9 (t + ©) de B (0) + u (¢ + 9) 
pe | -6 


Xde y (0)}). In particular, the system in Example 2.1 is Sturmian; all 

the necessary conditions are satisfied, and it can therefore be stabilized by 
a single force u or moment u in the neighborhood of the upper unstable 
equilibrium position, including the case when the only source of information 
is measurement of one of the coordinates &, or the velocities —&,. 3) If 

A, =’, or some a, = 0, then if A; >Othe system is unstabilizable; if 4,< 0, we 
have acritical (S) case, and approximations of higher orders must be investi- 
gated. The situationis similar in the case of C, O, orSby several coordinates. 


Let us consider how the principle of duality between C and O is 
transformed here. Suppose we have a system & =n, mn, =A,&. The adjoint 
system has the form  ; = —A,;n. n. = —&, and after renaming the 
coordinates and the velocities and changing the sign of the time we obtain 
the initial system. According to the duality principle, therefore, a system 
is controllable by a coordinate g, if and only if it is observable on the basis 
of this coordinate. 

It is of interest to study the stabilization of a mechanical system by for 
forces of specific types, and also to study the influence of forces of different 
types on the C, O, andS properties. The problem of stabilization of 
mechanical holonomic systems by dissipative forces has been studied in 
detail. From the viewpoint of general stabilization theory, the results may 
be interpreted and extended as follows. Suppose the system is stable for 
u=0, but not asymptotically. It is S up to asymptotic stability by forces 
of arbitrary nature if and only if it is S by dissipative forces. A system is 
S by a dissipative force on a coordinate q if and only if it is observable on 
the basis of this coordinate. S by a dissipative force is interpreted as the 
analytical construction of a regulator (lo (9 (f)) + u® (t)) dt = min, the minimiz- 

@ 


ing functional having precisely the meaning of dissipated energy. 
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Application of dissipative and gyroscopic forces usually improves the 
C, O, and S of a system. Consider for example the system of Figure 1, 
but assume that it is not two-dimensional, but each link has two degrees of 
freedom and may rotate in an arbitrary direction. Then some of the 
eigenvalues 4; will naturally be equal, and the system will not be C andS 
by a single force § or O on the basis a single coordinate &. If, however, 
one introduces, say, a dissipative force in the lower link along an axis 
forming an acute angle with the direction of the force u or with the direction 
of the moment u in the horizontal plane, the system will be C, O, andS. 

The system also becomes C, O, and S if a gyroscopic effect, due to a 
flywheel rotating about the axis /,, is introduced. 

14. Consider the stabilization of systems with aftereffect, described 
by equations with time delays (delay equations). The discussion of this 
problem is useful from two points of view. On the one hand, systems with 
delay are essentially systems with infinitely many degrees of freedom, and 
therefore involve some of the difficulties associated with such systems. 
At the same time, systems with aftereffects possess many regular 
properties by dint of which they resemble systems described by ordinary 
differential equations. On the other hand, when considering the OP, systems 
of delay equations acquire a certain new interpretation. Here we shall only 
discuss the SP, assuming for definiteness that the control u is a scalar. 
Suppose the system is described by delay differential equations 


> = Ax(t) + Ayx (¢ —6) + bu. 


An eigenvector of the system (14.1) corresponding to the eigenvalue 4, 
is a nonzero vector S, such that x (9 = Se satisfies equation (14.1) (when 
u=0). The eigenvector corresponding to the eigenvalue 4, of the conjugate 
system is a nonzero vector S; satisfying the equation 


B= — (Ax!) + Aix (t+ 8)), (14.2) 
for x () = Se"'. When investigating the SP for system (14.1), one should 
separate from the system, considered in the function space x (t+ +) 

(—8 | + < 0), the finite subsystems which describe the behavior of unstable 
components of the motion. In these subsystems the force u(t) ensures S 

of system (14.1). We then obtain the following S-criterion. Suppose each 
eigenvalue 4; with Rea; <0 has only one eigenvector S; (otherwise the 
system is unstabilizable by a scalar control u). System (14.1) is S if and 
only if none of the scalar products 6S; for Rea; <0 vanish. The S-criterion 
generalizes to the case of a vector control u, as well as to the case of 
incomplete feedback information. in forms similar to those indicated above 
for systems of ordinary differential equations. The first-approximation 
S-theory can be developed similarly. Let us discuss another problem 
related to systems with delay. Suppose a dynamic system is described by 
the equations dx/df = Ax-+ Bu, y= Qx. We shall consider one possible 
approach to the construction of a regulator with aftereffect 


glu (t), ..., u(t), u(t + 6), y(t + 0)) = 0, 
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which stabilizes only the observable state y(f) of the system. This system 
can be compared to the delay equation dy/dt = p ly (¢ + 6), u (¢ + 8))J, which 
describes the motion (variation of the signal) y(f). The sufficient conditions 
for stabilization of this equation, derived as before, are sufficient conditions 
for stabilization of the motion y (6. 

Consideration of this problem leads to the following interpretation of 
delay equations. Consider the system 


dx/dt = Ax, y = Qx. (14.3) 
This system can be compared to the delay equation 
dy/dt = p{y(¢+ 6)] (-8< 6 < 0), (14.4) 


which describes the observable motion y (4). The nature of the right-hand 
side of equation (14.4) (discrete or continuous distribution of delays) is 
determined by how the norm of the operator solution is selected when the 
L-problem of moments is used for solving the OP. Conversely, every 
delay equation (14.4) belonging to a fairly wide class can be regarded as 

an equation describing the observable motion y (t) of some system (14.3); 

in particular, of a finite-dimensional system describing the variation of the 
elements of the canonical expansion of the motion (14.4), which correspond 
to some set of eigenvalues {A,}. From this point of view it is useful to 
consider the indicated canonical expansion. In addition, this method 
provides estimates characterizing the coefficients and properties of the 
motions (14.3) and (14.4), as well as estimates characterizing the approxi- 
mate solution of problems of analytical construction of an optimum regulator 
for the system (14.1), when it is approximated by finite-dimensional 
canonical expansion systems. 
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A.M. Letov 
OPTIMAL CONTROL AND STABILITY 


I. FORMULATION OF THE BASIC PROBLEM 


§ 1. In place of an introduction 


The concept of Lyapunov stability of motion remains the most fundamental 
in modern science, particularly in the field of automatic control /1—8/. 
During the past 70 years this concept has undergone the most intense 
development; a particularly large contribution to stability theory after 
Lyapunov is due to the Soviet school of N.G. Chetaev. 

The importance of stability theory was enhanced anew with the develop- 
ment of the theory of optimal control (TOC). This theory arouses great 
interest among engineers and scientists /9—27/. Its essential goal is to 
work out rigorous algorithms with a view to realizing the optimal operating 
schedules of machines, factories, economic regions, and any other purpose- 
ful man-machine combinations. 

The search for optimal forms of control is now being applied to ever 
more complicated processes in diverse fields, whether they are determinate, 
stochastic or involving game situations. 

Here we shall consider the simplest processes describable by ordinary 
differential equations. 

A profound relation between the optimality problem and Lyapunov 
functions has been established recently for such processes /128/. Examples 
of this relation were indicated in /29—31, 34, 75/. Thanks to this relation, 
Lyapunov's stability theory, which had hitherto studied only purely topolo- 
gical properties of control systems, has now assumed a quantitative aspect. 
The latter is characterized by a functional with respect to which the system 
is optimized. The present paper deals with this relation. 


§ 2. Brachistochronic motions 
Let us consider controlled objects describable by equations of the form 
xa X(x, t, u). (2.1) 


Here x{x,,...,%} is the state vector of the object, wis a scalar (the 
control function), and ¢ the time. 
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The vector X of generalized forces expresses the physical laws governing 
the controlled object. X is assumed defined in some region 


N(x, t, uy >0 (2.2) 
of the phase space x,u for 
tell, T) (2.3) 
or 
t G[to, 00). (2.4) 


It is appropriate to base the TOC on Lyapunov's concept of perturbed- 
unperturbed motion and to divide all optimal control (OC) problems into 
two essentially different classes /9, 27, 29, 30, 32, 34/. To the first 
class belong programming problems of brachistochrone motions, formulated 
as follows. Given equation (2.1), the region (2.2), (2.3), and boundary 
conditions represented symbolically by parentheses. 


(i, f) =9. (2.5) 


A class of comparison functions x and u is given (for example, for x, the 
class of piecewise-smooth and for u that of piecewise-continuous functions) 
on which the criterion functional (CF) is defined: 


® = D(x. u, ty’. (2.6) 


Functions x,u satisfying (2.1). (2.2), (2.3), (2.5) will be called 
admissible. ~ 

The programming problem. Among the permissible x, u, to find those 
which minimize the functional (2.6). 

It is assumed that there exists a unique solution of the form 


x=x°(t, (i, J), uu [t, (i, A). (2.7) 


We call equations (2.7) brachistochronic since they are expressed in 
parametric form, i.e., as in the solution of Bernoulli's famous problem 
267 years ago. 

The prograinming problem is interpreted here as a mixed problem of 
dynamics in which the forces and motion are given partially and it is 
required to find the unknown forces and the motion itself. 

Mathematically speaking, this is an ordinary (classical or nonclassical) 
variational extremum problem, and a huge and very interesting literature 
is devoted to its solution. 

This problem is not considered in the present paper. In order to 
characterize it at least to some extent, we list the best known monographs; 
papers, of which there are more than 40,000, are not indicated. 
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§ 3. Basic problem of the TOC 


We shall consider /33—35/ brachistochronic motion as Lyapunov 
unperturbed motion. This makes it possible to formulate another OC 
problem, essentially different from the first, which is in our opinion the 
basic problem of the TOC. 

An obvious and easily verified fact is that brachistochrone motion (2.7) 
cannot be realized by a control of the form u = u*, since conditions (2.5) 
cannot be satisfied exactly. The true motion will therefore be described 
by functions: 


rs=x°+n, u=u'+6, (3.1) 


where n {n,,...,) is the Lyapunov perturbed motion, and 4 (0). — (0) are the 
initial perturbations. The equations of the perturbed motion have the form 


n= Byn+mt+8(n, &, +9(0), (3.2) 


in which the right-hand sides are series expansions of the functions X. 
In particular, these may be Taylor expansions, in which the matrices 


ax, {I° 

B=[by|= a, 
ax, ||* 

m = |r| =i ar 


are defined for the motions (2.7). These equations are defined in the - 
region (2.2), (2.3). 

We note that in formulating the basic problem there is no need to assume 
that the motion (2.7) is brachistochronic. It may be any motion agreeing 
with equations (2.1), which we wish to impose on the system. 

For any 7 (0) =, constrained (possibly) by the condition 


Di (0) < A, (3.3) 


where A is a given number, and a given function € (, equations (3.2) 
determine what is called a transient process in automatic control theory. 

One of the principal problems of automatic control theory is to control 
a transient process by Suitable definition of §. It is required to find both 
the method and the means of cancelling a transient process in a manner 
which is, in some sense, optimal. In particular, it is desirable that for 
t=T: 


=ni (T) < 4, (3.4) 


where 6 is a specially selected number. 

We therefore introduce a certain measure estimating the deviation of the 
true motion (3.1) from the programmed one (2.7), or a measure estimating 
the quality of the transient process. For this purpose we select a class G 
of comparison functions, on which we define the functional (3.5). From the 
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mathematical viewpoint this class should be as wide as possible, but its 
size may be limited by practical considerations. 
Let the functional be 


T 


J@=r lw & E Nat + swin(T), E(T). Th, (3.5) 


where W and @ are nonnegative and sufficiently smooth functions of their 
arguments, and r,s are nonnegative numbers. 

The simplest problem can be formulated without allowing for the specific 
form and features of the current technological means on which the realization 
of the required solution will be based. 

The basic problem. It is required to determine a differential equation 


FL... 8, &, gE. n, t) =0, (3.6) 


over the class G of admissible comparison functions, which expresses the 
control law, such that a controller constructed in accordance with the law 
(3.6) and attached to the object (3.2) guarantees that the functional (3.5) is 
a minimum for all motions starting in (3.3). This is the problem of 
synthesis (or analytical design) of optimal controllers. 

Remark 1. Inequality (3.4) is in general not a boundary condition. 
The expression Zn? (7T)is itself a functional, and (3.4) only expresses the 
natural desire to bring the representative point of the system to some 
restricted neighborhood of its state f. In the particular case when by,, 

m, = const or are periodic functions of ¢, the idealization 7 = o is admissible 
for the functional (3.5). We may then put 6=0, and condition (3.4) means 


Na(0c) = 0 (k= I1,...,2), (3.7) 


i.e., the controller (3.6) must also ensure asymptotic stability. This is 
concomitant with the boundedness of the functional (3.5) for T= oo. If 
nevertheless T < oo, then by setting 


‘= poh (3.8) 


we reduce the basic problem to the former form. Thus, even for bounded 
T, we may formally speak of the stability ensured by the controller (3.6). 
In a sense, this will mean monotonic damping of some function V (y, , 9, 
positive for ¢e€ (0,7). 

In this case we can only guarantee the existence of a number 8< 4A such 
that inequality (3.4) is satisfied. Of course, in other respects this reduction 
is formal in character, and it does not essentially exclude investigation of 
the solutions of the problem. 

Remark 2. If W does not contain derivatives of —, equation (3.6) 
degenerates and becomes algebraic: 


F Ik, nt] = 0. (3.9) 


We shall say that this defines an ideal controller, i.e., one which 
requires a source of energy of infinite power. 
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By appropriate change of variables the basic problem may also be 
reduced to the design of an ideal controller in a number of more general 
cases. 

Remark 3. In formulating the basic problem we have assumed that 
the mathematical model (3.2) has the property of ''complete controllability 
and complete observability' /75/ or the property of ''complete stabiliza- 
bility’ /50, 72—74/. The requirement of ''complete observability'’ may be 
weakened. As regards ''complete controllability or ''complete stabiliza- 
bility, '' these seem so natural and obvious that there is no question of 
weakening them. 


§ 4. General control scheme 


Suppose that the control law (3.6) is known. The general block-diagram 
of a control system is given in Figure 1. 


Conrolled 
object 
Controller 

F0 


Programmed 
device 


The brachistochronic motion is located in a programmed device. Such 
a device can always be built by using clocks. It emits signals which proceed 
to an adder. They affect programmed deviation of the control unit u’; an 
additional deviation & is established by a controller, which utilizes informa- 
tion n on the true state of the controlled object. The latter consists of a 
suitable measuring apparatus which operate in accordance with the algorithm 
inherent in (3.6). The diagram represents the general idea of the current 
theory of automatic control. 


4 


§ 5. Selection of the criterion functional (CF) 


The choice of the CF is governed by our subjective estimate as to what 
is desirable in the specific case in the question. 
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For s=0 the functional (3.5) is a measure of smoothness of the perturbed 
motions n,§ along the brachistochrone. The presence of suitably chosen 
expressions in W depending on £,§ is aimed at minimizing the energy 
consumption of the controller. 

In terminal control problems the functionals (3.5) are studied for r= 0. 
They characterize the measure of deviation of the system from the terminal 
state f at the instant T/11, 23, 47—49, 56/. 

At present, investigations are concerned mainly with quadratic functionals. 
From the normal point of view, a pre-selected functional should be regarded 
as apostulate determining the character of the resulting control theory. And 
as in geometry, each preassigned functional is associated with its own theory 
of optimal controller design. Just as at the dawn of its development mankind 
selected the simplest and most intuitive geometry — Euclidean geometry — 
for its first science of measurement, our first steps in the theory of optimal: 
controller synthesis are based on the simple and intuitive quadratic functional. 

Other important grounds for studying quadratic functionals will be 
discussed later. However, whatever the CF, its minimization means that 
the corresponding unperturbed motion will have an optimum measure of 
Stability in the adopted sense. Consequently, the functional characterizes 
the quantitative aspect of the stability of the unperturbed motion. 


§ 6. Time optimal control and terminal control 


If we put 6=0, rW = 1, s=0 in (3.4) and (3.5), we arrive at the time 
optimal control problem. For'r=0 we obtain the terminal control problem 
/11, 12, 47—49, 56/. Although formally both problems are included in the 
general framework of the basic problem, there are nevertheless grounds 
fer their separate treatment in the sequel, in connection with the method of 
solution of the basic problem. 


ll. PROGRESS IN THE SOLUTION OF THE BASIC 
PROBLEM (BP) 


§ 7. The general situation 


The synthesis problem is new in comparison with the programming 
problem; it stems from the requirements of modern engineering and its 
historical development. On the basis of experience and intuition, engineers 
constructed, to the best of their ability, controllers of the form (3.9), and 
later of the form (3.6), mainly in order to maintain desirable steady-state 
conditions of the controlled objects. Their intuition revealed universal 
control principles: the principle of control by deviation (n,&), the principle 
of control by perturbation @(f, and the feedback principle. 

In principle, science is now occupied only with a search for improved 
and more accurate forms of these principles. The simplest example of 
improvement is the current TOC. In more complicated cases, in accordance 
with the cybernetic conception of N, Wiener, we are striving for perfect forms 
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of these principles in adaptive systems. The desire for optimality is in 
itself not so much commendable as necessary, in view of the extremely 
limited material resources at mankind's disposal. 

An effective solution of the synthesis problem has been or may be 
obtained in a small, though nevertheless impressive, number of cases. 
Before proceeding to their description, we briefly describe the mathema- 
tical methods used. In so doing we shall be guided by the experience 
reported in the literature as to the measure of success of these methods. 

We sincerely apologize to the mathematicians for such a primitive 
approach to evaluation of these methods. For applied scientists such an 
approach is inevitable and it is justified by the desire to obtain, as quickly 
as possible, an expression of the concrete content of the problem in 
analytical form. Evaluation of the potential possibilities of the methods 
remains the task of the mathematicians themselves. The most we can 
indicate here amounts only to the substance, possibly, of inadequately 
justified guesses. 


§ 8. Methods of variational calculus 


The first, simplest, synthesis problems were solved by methods of 
variational calculus /33/. The mathematical school of L.S. Pontryagin 
has developed these methods considerably and given them an elegant 
description in the form of a single theorem, the so-called ''maximum 
principle" /9/. 

The maximum principle has brought about improved understanding and 
solutions of modern complicated variational applied problems defined in 
closed spaces, and has been used extensively to solve programming 
problems. An important contribution to the development of variational 
methods is due to V.A. Troitskii /99—104/ and V.F. Krotov /91/. Varia- 
tional methods possess also a number of other important advantages over 
other methods. For example, they yield an unambiguous proof of the 
following fact. If the function W in (3.5) contains an m-th order derivative 
of &, equation (3.6) is also of order m. However, practical application of 
the methods involves difficulties which have not yet been overcome, mainly, 
the need to express the auxiliary Lagrange multipliers in terms of the 
phase coordinates x. These multipliers are not related to the content of 
the problem and have no part whatsoever in its formulation. The famous 
two-point boundary-value problem therefore stands in our way as afortress 
attacked many times, but as yet unconquered. 


§$ 9. The Lyapunov-Bellman method 
The method of dynamic programming (DP) or that of V. F. Krotov seems 
more attractive in a number of more special cases /10—16, 25, 27, 91/. 


The DP method involves investigation of a certain partial differential 
equation for the function 


T 
V (n, ) = Sirw + s8(t—7) w) dt, (9.1) 
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which we shall call the criterion function. This equation is the mathematical 
expression of Bellman's optimum principle. The DP method based on it is 
applicable to the investigation of many optimum processes. Of course, this 
method also introduces certain difficulties. For example, in the solution 

of the basic problem its scope is limited to functions V which are continuous 
and piecewise-smooth with respect to n and £, and this cannot be ascertained 
in advance. Fortunately, however, when applied to the basic problem the 
DP method is closely related to the direct Lyapunov method of constructing 
V -functions for the stability problem. The relation is as follows. Asa 
rule, it turns out that the optimum principle is satisfied by some set of 
criterion functions V (y, 4 /34/. The solution of the problem is given only 

by those which are Lyapunov functions for closed-loop systems. Such 
functions are called optimum Lyapunov functions (OLF) /29, 30, 34, 43, 58/. 
Controllers based on such functions are optimal and ensure Stability of 
closed-loop systems. 

The Lyapunov method therefore picks out the necessary solutions from 
those given by Bellman's optimum principle. It is this combination of the 
two methods which yields an effective solution of the synthesis problem. 
Moreover, Lyapunov's rigorous and fundamental concept of perturbed- 
unperturbed motion is essential to the very formulation of the synthesis 
problem. In its essence, this concept, developed 70 years ago, was a 
natural basis for the development of classical control theory, and is now 
becoming the foundation for modern control theory. 

We shall formulate the basic relations of the method in the form of 
sufficient optimum conditions /29, 43, 58/. Let the equations of perturbed 
motion have the form (3.5), definedfor |[nJ<R, ¢e€ (0, 0): R >0 isa given 
number. 

By a Suitable choice of equation (3.9) for —, we must minimize the 
functional (3.5) Is = 0, T = oo] over all the motions in the region |ng< Rk, <R, 
so that the state n =0 of the closed-loop system is asymptotically stable 
for all n, such that Jn.J< R,. It may turn out that R, =R= oo. Then, if 
there exist a function V° (n, t) and a function & (n, satisfying equation (3.9) 
such that: 

1) they are defined in the region }nJ<R: 

2) the function V° (n, 4 is positive definite; 

3) at the point »n =0 the function V admits an infinitesimal upper limit: 

4) Sup [V*, jnl< Rol < inf [V*, In] = RI: 

5) the derivative of V° along the trajectories of the system closed by 
the controller (3.9) satisfies the equation 


(Ve. =-—V (n, e; t) (9.2) 


and the optimum condition 
Ve + W (nm, &, t= min (Ve+ (ng 0) = 0, (9.3) 


then £° is an optimal control, and we have the following equation: 


V°(no, 0) = min) Wim, & A) dt. (9.4) 
© 
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If equation (3.9) has been found, the stability of the closed-loop system is 
ensured by conditions (1—5), since V° is a Lyapunov function, and the 
optimality of the controller follows from equation (9.3), which expresses 
R. Bellman's optimum principle. 

For T < oo, the principle formulated above is still meaningful, if the 
definition of the function V° also utilizes the boundary condition 


V°(n, T) = so, (9.5) 


and "stability is interpreted as in § 3. 


§ 10. Solvable cases of the synthesis problem 
(analytical design) /29—63/ 


Let us first consider steady-state motions. We assume that: 

1) there are no perturbing forces: 9 () = 0; 

2) the right-hand sides of equations (3.2) and the function WV may be 
expanded in entire series in y convergent for |jn||<cp < R,; 

3) the equations are linear in § and W is a quadratic function; 

4) the domain of definition AN (x, u) of the problem is open. 

These assumptions correspond to the relations 


Th = >) OneNe + Dy Ceenene +...+ mf, 
e ef 


(10.1) 
V = 2) 2idapnans + D2) DCoaonete +... +8, > 0. 
a per 
Then the OLF is represented by a series of forms: 
Vev (on tv" (On O+.-.-+= DV (nm, 0, (10.2) 


where m is the order of the form, m= 2,3,.... The control law is deter- 
mined by the formula 


b=— =m a. (10.3) 


Series (10.2) and (10.3) converge for ||n||<( p, where p is sufficiently small. 

To find the forms V™ we must solve systems of recurrent algebraic 
equations, the principal of which is a system of quadratic equations of the 
form 


Ox Gap +2 DibreAnp — (2 mane) (D maAss) - (10.4) 


It depends only on the coefficients of the linear part of equations (10.1) and 
the quadratic part of the expansion of the function W, and determines the 
coefficients of the form Vi. 
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The coefficients of the remaining forms are determined by solving the 
corresponding systems of recurrent linear equations. The latter are 
always solvable if system (10.4) can be solved. This determines the 
importance of the simplest problem —that of the synthesis of linear systems 
whose criterion functional is quadratic. 

Under the conditions of the present problem vanishing perturbing forces 
can be taken into account. The equation for § then corresponds to the 
principle of combined control, containing signals for both the deviation y 
and the perturbation 9 /44—46/. 

For nonsteady-state motions defined for ¢e (0,7], or for W depending 
on ¢t, the optimal Lyapunov function (10.2) and the solution (10.3) have the 
same form. Now, however, the quadratic equations (10.4) become Riccati 
equations 


— Ap = Gap + 2 » breAns — (2) 7.Are) (2 mA) . (10.5) 


Their solutions determine the coefficients of the form V®,; the coefficients 
of the remaining forms satisfy recurrent systems of linear differential 
equations. 

These systems must all be integrated, starting from a definite value of 
the function V for t=T, from T to 0 /60/. The results admit effective 
generalizations to more complicated dependence of the equations and the 
function W on €, and to the important case in which the vector m depends 
on yn and ¢ and is representable by a convergent power series in n. 

These constructions raise a number of mathematical problems, specific 
for the theory of differential equations. An example is the following /35/. 


Given the form Vi = S)})Aaxex, and the system of equations As, = fx (An. . 


--+Ana, 4, it is required to determine necessary and sufficient 
conditions on the functions /,, and a region of initial values A,, (0), for which 
the form V® is positive definite. This problem was solved recently by 
N.P. Erugin /98/. 

In conclusion we note that the Lyapunov-Bellman method and all the 
results described above, may be rigorously generalized to hereditary 
/50, 72, 74/ and stochastic processes /64—71, 76—84/, which can be 
synthesized in conditions of only partial observability. 

If the basic problem is defined in a closed space WA (x, u) > 0, an additional 
mathematical problem arises, concerned with a more detailed investigation 
of the properties of the function V, in particular of the continuity properties 
of V and dV/dx, in the whole space NW (xu)>0, and its uniqueness. We shall not 
go into this here. 

Thus, the Lyapunov- Bellman method provides a unified basis for the 
solution of the synthesis problem for optimal controllers, i.e., the problem 
of technological means ensuring optimal measure of stabilizability for any 
unperturbed motion. From the aspect of application, the method is of 
interest, since it allows for all possible, previously determined, technolo- 
gical aspects of the synthesis problem, such as the choice of technological 
means for realizing the controller. 
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III. INVERSION OF THE SYNTHESIS PROBLEM 


§ 11. Formulation of the inversion problem 


It has thus been proved that if a criterion functional is given in the basic 
problem, optimal control is provided by an optimal Lyapunov function V°* 
satisfying Bellman's optimum principle. It is of interest to examine the 
role of Lyapunov functions from the opposite direction: can any system for 
which a Lyapunov function is known be considered optimal, in some sense? 

Let us delineate the inversion problem more accurately. Let us assume 
that in the region WA (x, u) > 0 the differential equations 


n=f(n, t, p) (11.1) 


of the perturbed motion of some closed-loop system are defined, where 
PAD yo ee ,» Pm) are parameters, to be chosen in some region S, such that 
for each pe S the state 


v 


n =0 (11.2) 


is asymptotically stable. 

For a wide class of functions / (n, ¢, p) there exists a nondenumerable set 
of Lyapunov functions by means of which asymptotic stability may be 
detected /1—8/. Let V (nm, be sucha function. By virtue of (11.1) its total 
derivative 


Se et Dahan ft, p) (11.3) 


is a negative definite (or negative semidefinite, if it ensures asymptotic 
stability). 

The question is, given any pe©S, can we interpret system (11.1) as 
optimal, i.e., as the result of solving the synthesis problem for some 
other auxiliary system with a criterion functional whose minimum is 


Vimo 0) = \U(n, t, p) de. (11.4) 


What is the possible form of the auxiliary system and the criterion 
functional? As we see, the inversion problem is a typical inversion 
problem of variational calculus known even to Darboux /97/. 


§ 12. First approach to the inversion problem 
The synthesis problem involves three functions, V, W, &. If Wis given, 


we have the basic problem. Given V or € we obtain two different forms of 
the inversion problem. The first is the more general. 
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Consider the surface V (n, f) =C and the vector 


gradV = VO) hn (12.1) 


Its direction is that of the external normal nto the surface V=C. Now 
let us define a nonzero vector m (n, f) {m,,..., Ma} by 
(m(n, t)-grad V) = 0. (12.2) 
This vector always exists if the surface V (n, / =C has no singular points 


for any ¢ and C. 
Now let us consider the synthesis problem for the auxiliary system 


Th=fe(n, + ome(n, 6, (12.3) 
JE)=l\Uin H+tala Oe) de, (12.4) 
0 


where a (n, is positive for all n,f. By the Lyapunov-Bellman method the 
synthesis problem is solved by the equations: 


— Bau tat? + Di (fat mat) xe. 


(12.5) 
ay 
0 m 2a + >) Me an, 
Here » denotes the function 
vin, H=JIU (nO + a(n, 6) a. (12.6) 
t 
This function satisfies the partial differential equation 
op _ oy { oy \2 
—3z =U(n + Bhs — qa(B ane) (12.7) 
By virtue of (12.2), equation (12.7) can be satisfied by putting 


In this case the control ¢=0. Hence the theorem: any system of differential 
equations of the form (12.7), for which asymptotic stability is ensured by 
some Lyapunov function V (yn, 9, may be interpreted as optimal, i.e., as the 
result of solving the synthesis problem for some auxiliary system (12.3) 
whose criterion functional is (12.4), if there exists at least one nonzero 
vector m(n, 4 perpendicular to grad V. 
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Remark. The fact that the control ¢ vanishes identically is unimportant, 
since the role of control can always be transferred to a new function 
defined by 


where m* is an arbitrary nonzero vector not perpendicular to grad V. 
Let us establish an important geometrical property of the optimal 
Lyapunov function V° (n, ¢). 
To this end we define an operator 


D(w) = (grad ¥) my — 4a [U (n, N+ FE + Dg]. 


Any positive semidefinite functional p = V° (n, 4) will be called admissible 
if D(V’) > 0 in some region containing the point n=0. Then, if V’(y, dis 
any admissible function, the corresponding controller equation 


— (grad V*) m+ VD (V*) 
(ny, ) = —ee eee" 


obtained by solving the first equation of (12.5), is admissible. 
In fact, if system (12.3) is closed by the above control, the given function 
¥° is a Lyapunov function for it, since 


= — Ua, t) + ab) 


by virtue of the trajectory of the closed-loop system (12.3). Consequently, 
the control {° (n, f) determines an admissible vector n (). We then have the 
theorem: let V°(n,f be an optimum Lyapunov function; then for any 
admissible V (n, t) we have the inequality 


voin,g KV (ny 2). (12.9) 


Indeed, the optimum function satisfies equation (12.7). Referring to 
equations (12.5), it can be interpreted formally as the condition that the 
surface V’=C be the envelope of the family of admissible surface y° = 
with respect to the parameter (°. Inequality (12.9) then follows from the 
fact that V’ and V’ are Lyapunov functions for the corresponding closed-loop 
systems, V° being optimal. 

Alternatively, we can construct the following auxiliary synthesis system: 


m= fa(n t+ st Dm met mk, (12.10) 
se) = [Um \— ae (Say) + elm Oe], (12.21) 


where m is an arbitrary nonzero vector. 
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It is assumed that the integrand in (12.11) is positive. This is always 
true if 


4 8 
Vin >a (Dm a): (12.12) 


This inequality is bound up with the rate of decrease of the function V (m, ¢) 
along a trajectory of the system, i.e., with the rate of approach of the 
representative point to the unperturbed motion. Therefore, if there exists 
a positive number 6 such that 


Bx OES |x (fo) Per, (12.13) 
there exist estimates for the function V and V, characteristic for quadratic 
forms /5/, and inequality (12.12) is preserved. Consequently, the integral 
J (&) may still be interpreted as a positive measure of stability of the 


unperturbed motion. The equations of the Lyapunov-Bellman method then 
assume the form 


OW 1 ov \? 
— U(r, O— & (Ima) +a(n, te + 
my, 5) OV fs J 
+3 [fe + Ft Dag me + mk ae, (12.14) 
we Ov. 
0 — 2ak + 2m ay, ’ 
where » denotes the function 
C 1 eV \3 
vin = \ [Yin N—e(Ymar) tain ola. (12.15) 
a 
The function »=V satisfies equation (12.14); the control § is defined by 
the formula 
= — we Lz, - (12.16) 
To summarize, in many cases we can construct a nondenumerable set 
of auxiliary systems whose synthesis yields the given system (11.1). For 


all these systems, however, the minimum value of the criterion functional 
in the synthesis problem is unique and is defined by (11.4). 


$ 13. Second approach to the inversion problem 

An alternative approach to the inversion problem is based on explicit 
selection of the control &(n,4. We are given the equations of perturbed 
motion of the controlled object 


Te = fan, t, b) (A=wil,...,A), (13.1) 
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and a control law 
E=E(n, 2. (13.2) 


It is required to find all the functions W (m, ¢, 8) for which the integral 


Je) = (wn t, Bae (13.3) 
e 


is a minimum. 

For the case of linear and stationary equations (13.1) and (13.2) the 
inversion problem has been considered by Ya. Kurtsveil' /39/ and R. Kalman 
/96/. 

It is easy to see that substitution of § from (13.2) in (13.1) reduces the 
inversion problem to the form just studied, since the closed-loop system 
(13.1), (13.2) is asymptotically stable. 

However, this formulation of the problem may be of some practical 
interest in itself. The point is that the control law (13.2) is often based 
on various considerations of a purely practical nature. In these cases it 
is interesting to ascertain the possible meaning of the optimality of the 
closed-loop system (13.1), (13.2). 

The inversion problem is easily solved in this formulation if optimality 
can be established on the basis of the sufficient Lyapunov-Bellman conditions. 

Indeed, let us try to find a function W of the form 


Vint, E) = Un, tht o(n HE 4), (13.4) 


where 9,/ are known positive functions. 
The sufficient optimum conditions are 


— HU +H +The, (13.5) 
eV 
0 = 9(n, jy 4 Ot Se. (13.6) 


If §&(n, f (13.2) is an optimal control, the corresponding Lyapunov function 
is 


V (n, f) = min 7 (n, t, &(a, )) dé, (13.7) 


which satisfies the partial differential equation 


= - 
R(n, N+ BYR = 0% (13.8) 
where we have denoted 
df _ f Fe 
R= [et een Y= Ee 0° (13.9) 
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Knowing even only one function (10.27) satisfying equation (13.8), equation 
(13.5) yields a function W (m, t,  (n, which establishes the meaning of the 
optimality of the control (13.2). The function VW must be positive semi- 
definite, or positive definite. 

It may occur that for some choice of the functions /,@ the solution of 
the inversion problem is unique. 


§ 14. A practical result 


Suppose an asymptotically stable system (11.1) is given. Consider some 
positive semidefinite function U (n, f) and define 


e(p) = \U (n, t) dt (14.1) 
e 


as the criterion function of the transient process (11.1). Then, using the 
given U (n, f) we find the Lyapunov function from the Lyapunov equation 


a tDhiw 5 =—U(y #). (14.2) 
From the foregoing 
t(p) =V(m, 9, p). (14.3) 


The criterion function depends essentially on n,. We can give it a more 
convenient, invariant form, by considering the hypersphere I]n,§= 1 and 
calculating the integral 


(0) = \V (ne 0, P) duio, - - - + AMno. (14.4) 


It depends only on the parameters p. Hence the possibility of minimizing 
e(p) on S, or looking for inf e on S, as an integral quality criterion of the 
transient process. 


§ 15. Differences of opinion with Dr. R. Kalman 


In /96/ R. Kalman gives the investigation of the inversion problem 
preference over the solution of the basic problem (BP). In his opinion, the 
BP has no practical meaning, since the criterion functional is selected on 
an unreliable and subjective basis. 

We cannot agree with this point of view. As already indicated above, 
there is no need to justify the choice of the criterion functional. We need 
only know, as in every synthesis, what properties we wish the system to 
have, and this is the basis of our choice. The content of the BP is not 
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reduced merely because we do not yet have this knowledge. If the criterion 
functional possesses the required properties (and other properties are 
immaterial), we can find the optimal control law, which is unique in most 
cases of practical interest. 

In the inverse problem we have no alternative choice other than the 
natural effect of the system (11.1), whose quality is expressed by the given 
number (11.4). Kalman's point of view therefore implies intentional 
rejection of possibilities which we can always effectively use if we are 
capable of distinguishing good from bad or better from good. 


The author is grateful to N.N. Krasovskii for his effective assistance 
in preparing this paper. 
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DEVELOPMENT OF THE METHOD OF LYAPUNOV 
FUNCTIONS IN STABILITY THEORY 


The object of this report is to extend Lyapunov's second method /1/ by 
Simultaneous employment of several functions V, and of higher order 
derivatives of V, in the criteria for stability of motion. When this is done, 
the demands imposed on each function V are less stringent than those 
figuring in the classical Lyapunov theorem or in its generalization, which 
involve single function V and its first derivative. This facilitates the 
search for the required functions V and makes the method more flexible. 

The majority of existing criteria originated in applied problems of 
stability of mechanical systems. Applications, however, are beyond the 
scope of this paper. Neither do we deal with other aspects of the develop- 
ment of the method of Lyapunov functions in the theory of stability of motion 
(or with the qualitative theory of differential equations). In particular, we 
do not touch upon the well-known generalizations of the theorems of 
Lyapunov's second method, in which the function V is determined in 
"piecewise'' fashion in separate regions covering the neighborhood of the 
unperturbed motion (which are in general also interpretable as stability 
theorems involving several functions V). Neither do we discuss extensions 
of the results to systems describable by equations other than the customary 
ones. 

The selection of the material was largely determined by the interests of 
the author and has no claims to completeness. 


Introduction 


§ 1. Given a system of equations of perturbed motion in vector form 


dx 

a =X, 2b). (1.1) 
Here x = {x,,..., X} is a vector in Euclidean n-space E£* with norm 
jxp= Vet... +; X (x0 = (X. (x, 9,..., Xn (x, 9) is a function vector; 


X (0,1) =0, i.e., system (1.1) admits the unperturbed motion x=0; the 

real functions X, (x, are defined and continuous in the region’: |zJ< 4,t>0: 
(H =const >0 or H =o) and satisfy the conditions of uniqueness, nonlocal 
continuability, continuous dependence on the initial conditions of the 
solutions of the system in the region f. Thus the perturbe ‘i motions are 
described by a function vector x(t, x, 4), which is defined and continuous 
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for {x,t} GT, t © (4, 0) as long as [xJ<(_H°, and continuously differentiable 
with respect to ¢. 

Consider real functions V, (x,4), . ...Vs (x, ) defined and continuous in the 
region Ir together with their first derivatives with respect to ¢, the latter 
satisfying the equations of perturbed motion 


. eV, eV, t 
V, (x, th= ss 243 ti 


Xi(z, ) (s=1,....8)s 


and vanishing in the unperturbed motion, i.e., V, (0,0 =0,V,(0,9=0. For 
these functions V (x, t) = {V, (x, 4,...,Va (x, 4) we introduce a norm |V] = 
= YVi} +... + Val. 

If the function V, (x, has partial derivatives up to the second order with 
respect to x,,..., x, which are continuous in IF, and the functions X; (x, f) 
are continuously differentiable, we can introduce the second derivative of 
each function V, with respect tos, in accordance with the system (1.1): 


vs = t) 


Vi, ya ane > Xi (x, 0). 


Higher -order derivatives are defined analogously. The derivatives of 
the functions V, will always be assumed continuous in [ by virtue of the 
equations of perturbed motion. 

We shall also consider realfunctions V (x), V’ (x), defined and continuous 
in the region(H):|{x|< Hof the space £*,V (0) = 0, V’ (0) = 0. 

Let us denote the set of points x © (H) for which V‘ (x) = 0 (or V’ (x) >0), 
by EWV° =0)(or EV’>0)). The set of points {x, ¢} @T for which V (x, =0 
(or V (x, 4) > 0), will be denoted by BV=0)(or BV>0)). 


Instability 


§$ 2. After the fundamental work of A.M. Lyapunov (1892) /1/, the first 
important step in the development of the method of Lyapunov functions 
in the theory of stability of motion was made by N.G. Chetaev /2/. 

It was his idea to set up stability criteria with two functions V and use the 
second derivative of the function V. 

The second derivative of the Lyapunov function was already used in 1902 
by Duhem to obtain a criterion for equilibrium instability of dissipative 
systems /3/. In the course of his investigations in /3/, Duhem in effect 
proved the following theorem for autonomous systems 


dx/dt = X (x). (2.1) 


Duhem's Theorem. If there exists a positive semidefinite function 
V (x), whose first derivative V together with V may assume positive values 
in any neighborhood of the unperturbed motion x=0, and the second 
derivative V is positive semidefinite, then the unperturbed motion is 
unstable. 


* Inthe case H =o, the solutions x(f, ze, te) are defined and continuous for all ¢€& [, oo). 
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In the paper ''On the Inversion of Lagrange's Theorem" (1930) /21/, a 
far more general theorem was proved. 

Chetaev's First Theorem. Suppose the function V (x) for system 
(2.1) possesses the following properties. 

1. In the region E (V > 0) defined by the inequality V >0, the function V 
is also positive. 

2. Near the boundary surface V = 0 of the region E (V>0), the second 
derivative V has a fixed sign, and has the same sign or vanishes on this 
surface. 

Then the unperturbed motion x = 0 of the system (2.1) is unstable. 

It is clear that the meaning of the demands imposed here on the second 
derivative V is that they ensure the existence of perturbed motion with 
arbitrarily small initial perturbations, which cannot leave the region 
E V > 0) without moving away from the unperturbed motion to a given 
distance. This is illustrated in Figure 1, in which near the surface V = 0: 
a) V>0: b) V<0. For the general case, when the function V also depends 
explicitly on time, K.P. Persidskii introduced the concepts of sector and 
semisector /4, 5/ to characterize such a situation. 


a b 
v<0 
Vv=0 
v>0 
v=0 


v<0 


Let us consider a subregion A of the region |x] < 4,t >0(0 <A = const < H) 
such that its intersection A (st) with any hyperplane ¢=+t (t>0)is an n- 
dimensional domain (open and connected), its closure [A (z)) containing the 
point x= 0 and at least one point with |x|= 4. 

Persidskii's definition. A region A is called a sector (semi- 
sector), if for any positive number e (e< A) there exists an interior 
(interior boundary) point (x, f} (0 <x < e) such that the integral line 
Xx (t, X, fe) Temains inside (inside or onthe boundary of) the region A for all 
t>t. for which jx (f, %, to) l|<( 4. Let us try to characterize case a) of the 
figure. 

Definition 2.1. The region A is called an absolute sector (resp. 
absolute semisector), if for any interior point (x, f)@A (resp. interior or 
boundary point {x»,t,}& [Al)the integral line x (t, x. /,) remains inside (resp. 
inside or on the boundary of) the region A for all ¢>4, for which 
Wx (t, Xo, bo)]< A. 

It is clear that the existence of a sector or semisector can be ensured 
by different methods, in particular, by the introduction of an auxiliary 
function W with suitable properties. 
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Modifying his first theorem in this way in the papers /2/ ''A Theorem 
on Instability" (1934), ''On an Idea of Poincaré" 1934, "On the Instability 
of Equilibrium, when the Force Function is not a Maximum" (1938), Chetaev 
published his famous two-function instability criterion (for nonautonomous 
systems (1.1)). 

Chetaev's Second Theorem. If the differential equations of 
perturbed motion are such that: 

1) for some function V admitting an infinite small upper limit there 
exists a region where V-V> 0; 

2) for some «x of arbitrarily small absolute value, there exists a 
subregion of the region (V-V > 0) in which a certain function VW >0, on whose 
boundary W=0, and the total time derivative W has a definite sign; then 
the unperturbed motion is unstable. 

One feature is worth mentioning. Chetaev considers here the infinite 
closed interval of time variation,'' completing it by a singular point +o, 
and by the existence of some region in (#) means its nonemptiness for any 
f of the extended numerical half-line. 

Problems concerned with the instability criterion were subjected toa 
further careful investigation by Persidskii in his dissertation /4, 5/. 

Following Persidskii, we call the set of all the boundary points of the 
region A which do not belong to the cylinder ||x\|/= 4A the lateral surface ¢ 
of the region A. 

Persidskii's Sector Theorem. If the integral lines intersect 
the lateral surface ¢ of the region A in one (in strictly one) direction, then 
A is a semisector (sector). 

Corollary. Suppose there exists a function W (x, 4 which is positive 
in the region A and vanishes on its lateral surface ¢. 

(I) If W(x, 0 <0 for {x.1) Gl, x0, then A is a sector. 

(II) If W(x,’ >0 for {x, 4 Gt, x0, then 4 is an absolute sector. 

If the derivative W (x. f) is such that each point {x,f}@t, x+0 hasa 
neighborhood U such that: 

(III) W (x,t) <0 for (x,t) ©U() A, then A is a semisector; 

(IV) W(x.) >0 for {x,4} GUN A, then 4 is an absolute semisector. 

Persidskii's sector theorem was somewhat generalized by A.D. Myshkis 
/6/. He also established that the set of initial points of trajectories which 
cannot leave a sector or semisector through its lateral surface (with 
increasing time), supplemented by the origin, contains a continuum which 
connects the origin with the surface of the sphere ||x||= of any radius 
O<pP<A. 

The concept of the sector yields the following theorem /4, 5/. 

Persidskii's Instability Theorem. Suppose there exist a 
sector A and a function V (x. t) with the following properties in A: 

1) V (x, ) is bounded; 

2) V(x, >0 (for x +0); 

3) Viz. ON >n WV (x, 9. 4, 
where » (V,f) is a nonnegative nondecreasing function of V, for which 


(nt t)di=oo forany a>0. 
0 
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Then the unperturbed motion x= 0 of system (1.1) is unstable. 
Here one may avoid the requirement that the derivative V be semidefinite, 
by replacing condition (3) by 


3*) Vix, )>O()V (x, 0, Tim [0(t) dt = 0. 
1-000 9 


By making the requirement for the existence of a sector and conditions 
(3) and(3*) more specific, one can obtain many instability criteria convenient 
for applications. In particular, Persidskii's criterion generalizes the 
classical theorems of Lyapunov, Chetaev's well-known one-function theorem 
/2/, and many criteria obtained by Alma Ata mathematicians /10, 11/. 
Other instability criteria result from theorems of N.P. Erugin /7/, Masséra 
/8/, and V.I. Zubov /9/. 

Let us now consider another possible choice of the second function V in 
the instability criterion. 

For time-periodic functions X (x, in system (1.1), N.N. Krasovskii 
obtained /12/ the following generalization of Lyapunov's theorem. 

Krasovskii's Theorem. Suppose there exists a function V (x, é) 
having the following properties in Lr: 

1) V(x, 4 is periodic in f¢; 

2) for any ft, >0 there exist points x, in an arbitrary neighborhood of the 
unperturbed motion x = 0 for which V (xp, f,) > 0; 

3) V(x. >; ; 

4) V vanishes, if at all, only at points of a set MC (H) which does not 
contain complete semitrajectories x(t, X, t,), ¢€& [t, oo) other than x=0. 

Then the unperturbed motion is unstable. In this case, if the set M 
is a surface defined by an equation F(z,f) =0, then condition (4) is satisfied 
if F (x, #0 on this surface for x+0. 

Here F (x, f) may obviously be regarded as an auxiliary function V which 
eliminates’ trajectories from the ''dangerous' set M. 

§ 3. In this section the functions X (x, 4 in (1.1) are assumed bounded 
in Lf. 

It can be easily shown by an example that Krasovskii's theorem in the 
given form cannot be extended to the general case of nonstationary motions, 
when the right-hand sides of the equations of perturbed motion are non- 
periodic time functions. Nevertheless, the author has shown /13/ that 
appropriate modifications may be made in Krasovskii's theorem. 

We introduce the corresponding definitions of /13/. 

For any positive numbers a, A (0<a<A<H) we define the subregion 
Agi a <|[x]|<CA of (H). : 

Definition (3.1). W(x, #0 strictly (definitely) in the set E (V° (x) = 0), 
if in any region A, there exists a neighborhood of the set A. () E (V’' = 0) and 
a continuous function € (f)(a positive number &), such that for any ¢ & [0, 09) 


E(t) >0,  [e(n) dt = 0, (3.1) 
t 


and | W (x, |> & in the neighborhood. 
Definition 3.2. A continuous nonnegative function 9 (fis called 
positive in the mean, if for any infinite system s of closed nonoverlapping 


128 


MA TROSOV 


intervals of the half-line [0, oo) of equal positive length we have 
‘9 (t) = oo. 
r | 


The conditions of Krasovskii's theorem can be modified as follows: 

1) omit the requirement of infinitely small upper limit; 

2) retain completeness; 

3) add: V(z,N >e (OV (x), where V(x) >0 and 9 (f is positive in the 
mean (in particular, if 9 (4 = const >0, this means that the position of the 
set M in the phase space is fixed); 

4) suppose there exists an auxiliary bounded function W (x, 4 for wntch 
W (x, + 0 strictly in the set E(V = 0). 

Then the unperturbed motion is unstable /13/. 

Here condition 3) can also be maintained in the form V>0, if in addition 
we require that V(x, 4) and V (x, 4 be bounded, and in condition 4) that the 
derivative W (x, t) be strictly nonzero in the set B(V = 0) in the following 
sense. 

Definition 3.3. W(x. +40 strictly (resp. definitely) in the set 
BWV =0), if for any region A. there exist a positive number / and a 
continuous function § (9 satisfying conditions (3.1) (resp. a positive number §&) 
such that |W (x, 9|>€ for x@Ae (1, OEB (WV (x, O1< 0. 

Let us now combine the two directions of investigation described above. 

Theorem 3.1. Suppose there exist a sector dA and functions V, (x, 4 
and V, (x, 4 with the following properties in [A): 

1) V, (x, admits an infinitely small upper limit; 

2) V, (x, t) > 0 for {x, he A, x #0; 

3) V, (x, 0 >@ OV’ (x), where V’ (x) >O for xe (H) and 9 (is positive in the 
mean; 

4) V, (x, ) is boundedand Y, (xz, #0 strictly in the set E(V’ = 0) [ u A (t)}. 


Then the unperturbed motion x= 0 of the system (1.1) is unstable. 
Here too condition 3) can be maintained in the form V; (x, 2) > 0(for 
{x,t} ] [A)), by requiring that V, (x, 4) and V, (x, be bounded and replacing 

condition 4) by: 
4*) V,(x,t) is bounded and J, (x, #0 strictly in the set B (V, = 0) f [Al. 
As already mentioned, if V, (x, >0in some region A, V,=0Oon its 
lateral surface {, and for {x, 4} @B(V,= 0) 1) B (0 <(||x||<( 4) the derivative 
V, (x, & satisfies one of the conditions: 


(I) Vs (x, t) < 0, (IT) Vs (x, t) > 0, 


then A is a sector. 

Thus, as a consequence of Theorem 3.1 we obtain a three-function 
instability criterion which generalizes, on the one hand, Chetaev's second 
theorem, and on the other, Krasovskii's theorem. 

Setting V,=V,=V,, we obtain a corollary which is a modification of 
Chetaev's first theorem: 

Corollary 3.1. Suppose there exists a function V (z, 4 with the 
following properties in Ir: 

1) V(x,f admits an infinitely small upper limit, and V (x, M and V (x, are 
bounded; 
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2) there exists a region A in which V (x, >0, and V (z, 4 = 00n its lateral 
surface ¢: 

3) V(x.) >O for {x, HE A: 

4) V (x,t #0 strictly in the set BV =0)=¢; when for (x,f)E@l, x= 0 one 
of the following conditions is satisfied: 


(i) Vix, H<0, (II) Vix, > 0. 


Then the unperturbed motion x =0 of system (1.1) is unstable. 

§ 4. A completely different approach to the whole problem is possible. 
The conditions imposed on the derivatives of the function V in the Lyapunov's 
second method are simple differential inequalities. Thus, in the classical 
Lyapunov instability theorems they have the form V>86(V) @(V) >0forV>0 
/5/, and in Persidskii's theorem the form of inequality 3), 3*). 

It is natural to try to set up an instability criterion using more compli- 
cated differential inequalities and systems of differential inequalities. 
Though such criteria necessarily contain requirements on the functions V 
which are more complicated to verify, they may turn out to be more 
general; this enables us to use V-functions which do not satisfy the require- 
ments of the well-known instability theorems. 

The proofs may be based on Chaplygin's theory of differential inequalities 
/15/. Conditions for the solvability of Chaplygin's problem have now been 
obtained for systems of differential inequalities. Thus, in the well-known 
theorem of Wazewski these conditions represent monotony and continuity 
of the right-hand sides of the differential equations (and the corresponding 
inequalities) /16/. 

An instability criterion with several functions VY, whose derivatives V 
satisfy certain differential inequalities, was proposed by the author in /17/ 
and generalized in /18/. Here we formulate a criterion of this kind, in 
which the concept of sector is again involved. 


Consider real functions /, (V, x, t),..../. V, x,t), defined and continuous 
in the region Q:|VJ< 0, [xJ< A,t>0WV = {V,,....., V,}). In addition 
f, (0,0, t) =O (s=1,..., &). 
Let &() = (&, &), ..., Ea (Q} DE a Set of real functions which are continuously 


differentiable for? & [0, 00). We set up an auxiliary system of equations 


HHA 8, (S=1.. sb (4.1) 


where y = {y,,.. -. Ya}. 

Definition 4.1. The system (4.1) possesses the + y, -quasi-non- 
boundedness property for the functions V (x, and a region A if for any 
positive numbers e,4,,a,A (Q<a< 4A <A) and any continuously differentiable 
function & (f) such that {& (¢,), 4} & 4, OC PE@M)I<aE (0) <A for t& [y, 0), 
there exists a positive number 7 such that any solution y (#) of system (4.1) 
with initial conditions y (t,) = V (§ (4), %), defined for ¢€ [%,%,+T7), also 
satisfies the condition y, (+7) >e. 

Theorem 4.1. Suppose there exist a sector A and functions 
V iGO es e5 , Ve (x, with the following properties: 

1) V, (x, 9 is bounded in 4; 

2) Vil D>hV (49,29 (S=1,....8) for (x. SA; 
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3) each of the functions /, (VV, x, 4) (defined and continuous inQ) is non- 


decreasing in V,,...,Vs-1,Vse1... -,Ve /18/; 
4) the auxiliary system dy,/dt = f, (y, &§(), (s=1,..., 4) has the + y,- 
quasi-nonboundedness property for the functions V,,...,V, and the sector A. 


Then the unperturbed motion x= 0 of system (1.1) is unstable. 

When we have exact equalities V, =/,(V, x. for (x, t}e A in 2), condition 
3) may be omitted. 

For k= 1 and f not depending explicitly on x, we obtain: 

Corollary 4.1. If there exist a sector A and a function V (x, f) bounded 
and positive in A whose derivative V (x,t) > /(V (x, 4), ) for (x, eG 4, where 
f(V, ) is continuous and such that any solution of the equation dy/dt = 
= f(y, 4 (0, 9 = 0) with the initial condition y, >0, 4, > 0 increases without 
limit with increasing time, the unperturbed motion x =0 of the system (1.1) 
is unstable. 

This proposition generalizes both variants of Persidskii's theorem. 

Corollary 4.2. Suppose there exist a sector A and a function V (x, 4 
bounded in A, whose second derivative V (x, ) > aV (x, ) + 20V (x, for 
{x,t} 4, where a,6=const,a>0, and 6>0 if a=0, and V (x, + V (x, 1x 
x(V 6? + a — 6) >O for {x,t}G A. Then the unperturbed motion x= 0of the 
system (1.1) is unstable. : 

In particular, if V and V may assume positive values simultaneously for 
any ¢ in an arbitrarily small neighborhood of the unperturbed motion, and 
V>O0for (x,) GBV>0,V>0), BION Ed =h)#¢, then B is an absolute 
sector. Taking V bounded in Band a=0,6=0, we obtain the instability 
criterion of /17/. 

This criterion generalizes Duhem's theorem and its modification proved 
by Kh. I. Ibrashev /19/. 

Thus, by specifying the conditions of Theorem 4.1 in various ways, 
different constructive instability criteria can be obtained. 


Stability 


§ 5. Lyapunov suggested using some "part'' of the neighborhood of the 
unperturbed motion to investigate conditional stability. 

If an absolute sector A is available, generalizations of Lyapunov's 
stability and asymptotic stability theorems to the case of conditional stability 
for {x,, f.} = A are obtained trivially by requiring that all the conditions 
imposed on the function V be satisfied only in [A]. Obviously, the absolute 
sector can then be defined by an auxiliary function W. In this way two- 
function conditional stability theorems are obtained /20/. 

It is more interesting to determine the class of systems (1.1) for which 
the existence of a function V (x, f) possessing the relevant properties in the 
absolute sector A provides information about the behavior of the perturbed 
motions in a complete neighborhood of the unperturbed motion. Such a” 
class of systems has been determined by P.A. Kuz'min, in the form of 
linear systems with continuous and bounded coefficients /20/. 

The proof is based on the fact that any solution of a linear system is a 
linear combination of a solutions of a fundamental system, which can be 
taken from A. The existence of an absolute sector is ensured by introducing 
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an auxiliary function W, and so Kuz'min's theorems are two-function 
Stability criteria. 

The auxiliary function W may be incorporated into the asymptotic stability 
criterion in another wav. 

In 1952 E.A. Barbashin and N.N. Krasovskii showed /21/ for autonomous 
systems (2.1) that, under the conditions of Lyapunov's asymptotic stability 
theorem, the derivative V (x) need be only positive semidefinite, not neces- 
sarily positive definite, if the derivative V (x)may vanish only in a set M 
which contains no integral semi-trajectories other than x = 0. 

An analogous theorem was published later by A.P. Tuzov /22/ and 
generalized by La Salle /23/. Krasovskii has extended the theorem to 
periodic systems /12/. 

Here, as in the case of instability, the set M does not contain integral 
trajectories if it is a surface F (x, = 0on which F (x, f) #0for x+0; F too 
can be regarded as an auxiliary V-function. 

§ 6. In this section the functions X (x, 9 in (1.1) are assumed bounded 
inf. 

As before, examples show that the theorem of Barbashin and Krasovskii 
in its given form cannot be extended to nonperiodic systems. The author 
has modified it to obtain two-function asymptotic stability criteria /13/. 

Theorem 6.1. Suppose there exist functions V, (x, ) and V, (x, #) having 
the following properties in Ir; 

1) the function V, (x, 4) is positive definite and admits an infinitely small 
upper limit, 

2) the derivative V, (x, <@ (QV (x), where V° (x) <0 and the function ¢ (4 
is positive in the mean (resp. a positive number); 

3) the function V, (x, t) is bounded; 

4) V(x. #0strictly (resp. definitely) in the set E (V° = 0). 

Then the unperturbed motion x = 0 of the system (1.1) is uniformly 
asymptotically stable with respect to x,(resp. with respect to xp, f)). 

Condition (2) can be maintained in the form V, (x, f) < 0 by requiring the 
boundedness of the derivatives V, (x, 4) and V, (x, and replacing condition (4) 
by (4*): V, (x, ) #0 strictly (resp. definitely) in the set B (V,=0). 

Theré is an analogous theorem for nonuniform asymptotic Stability. But 
then the requirement of an infinitely small upper limit is imposed not on 
the main function V,, but on the auxiliary function V,, which thus possesses 
the following property. 

Definition 6.1. The function V, (x, f) admits an infinitely small upper 
limit in the set EV = 0)if it is bounded, V, (x,t) = 0 for xG@&E(V = 0), ¢ € [0, o) 
and for any arbitrarily small number / >0 and any region A, there exists 
a neighborhood of the set A. 1) E V° = O)in A, in which |V, (x, t)|<¢ / for 
t & 10, oo), 

If V (x) is assumed negative definite in the above theorem, the auxiliary 
function V, is not needed, and we obtain a one-function criterion for asymp- 
totic stability which generalizes Lyapunov's criterion in two directions. 
First, the requirement of an infinitely small upper limit is eliminated, and 
second, the definiteness requirement for the derivative V, is somewhat 
relaxed. 

This method of generalizing the Lyapunov criterion is due to Krasovskii 
/24, 12/, V.P. Marachkov /47/, Zubov /9/ and Reissig /25/. 

Both the above two-function asymptotic stability theorems and the 
theorem of Barbashin and Krasovskii may be extended to the case of global 
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asymptotic stability, if the conditions of the theorem include an estimate 

of the region of attraction of the unperturbed motion, and also to the case of 
asymptotic stability in the large, if the requirement of an infinitely great 
lower limit of the function V, is introduced. 

We shall formulate a nonuniform asymptotic stability theorem (including 
the above-mentioned estimate), defining the properties of the functions 
only in some absolute sector A. We shall obtain a modification, on the one 
hand, of Kuz'min's theorem, and on the other, of the theorem of Barbashin 
and Krasovskii. 

Theorem 6.2. Suppose there exist an absolute sector A and functions 
V, (x, t), V, (x, 4) which possess the following properties in (Al. 

1) V, (x, is positive definite and for some positive numbers 4, 
A(<A<A< A) Satisfies the inequality sup (V, (x, 4 for {x,t} E (A), IlxIlCAlL< 
< inf IV, (x, for (x,t) GIA), |] x|)/= Al: 

2) V, (x, 2) << (ft) VV’ (x), where V* (x) <0 for x © (AH), the function 9 () is 
positive in the mean; 

3) V; (x,t) admits an infinitely small upper limit in the set EV =0)f 


N tu 4 (r)1; 


+ © (0. 


4) Vs (x, +0 definitely in the set EWV°=0) MN 1[U 4 (I, 
te [o, ce 


Then the unperturbed motion x = 0 of system (1.1) is conditionally 
asymptotically stable for ¢, & (0, oo), xe 4 (t,) ME (lx ||[< A). In this case, 
if system (1.1) is linear, the asymptotic stability is unconditional. In 
particular, when the absolute sector A is defined by an auxiliary function 
W (x. 1), we obtain a three-function theorem. 

§ 7. Stability criteria using several functions V can be obtained on the 
basis of the theory of differential or integral inequalities, and also by fixed 
point principles. 

A stability criterion with several functions V, based on the theory of 
differential inequalities, is suggested by the author in /17/. In /18/ it is 
generalized by replacing the requirement of stability of the auxiliary system 
by the property of quasi-stability in the following sense. 

Given a system (4.1) and a function 9 (A), which is defined, continuous, 
and nondecreasing in the half-open interval [0,H), where q (0) = 0. 

Definition 7.1. The half-line y=0(¢>0)is called quasi-stable by 
virtue of system (4.1) with respect to y,,....y:({< 4) for the function 4 (A), 
if for any positive numbers A <H and f¢, there exists a positive number 
a (A, t) (@ << A,2<@ (A)) such that for any functions €, (,..., & (Q continuously 
differentiable for ¢ & [0, oo) for which |& (4.)|<a, any solution y (9 of the 
system (1.2) with initial conditions |y (¢,)|<ais defined and satisfies the 
inequality |y,(0|+..... + |y: (|< (A) in any interval [f,, ¢) in which |[§ (|< A. 

This definition generalizes the corresponding concepts introduced by 
Persidskii /26/, E.I. Dykhman /27/, and Zubov /9/, allowing for an 
observation of Erugin /7/ and a definition of Rumyantsev /28/. 

Definition 7.1 is now transformed in the usual manner in order to obtain 
definitions of asymptotic, uniform, and conditional quasi-stability /18/. 

Let us formulate a stability criterion with several functions V with 
respect to some of the variables x,,...,%n. We put H= oo, i.e., here the 
region LF is 


ix] < 0, f>0. 
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Theorem 7.1. Suppose there exist functions V, (x, f),...,Vs (x, with the 
following properties in Ir: 

1) Vilz, >0,...,Ve(x, 1 >0 (Say (7.1) 

Vitdx, OP) +... 4+1Vi (x4, OL >Vo (a... em) (ma), 
where the function V° (x,,..., xm) is positive definite with respect to x,,..., %mi 

2) Valx, OSS, WV, (x, 1). 2 1 Ve (4) 0, Xp 0 Xm *) (SH 1,..., 8)s 

3) each function f, (V,,...,Va, x --..%m,f) is nondecreasing in V,,..., Vs-4, 
Views oo ee Va; 

4) the half-line y = 0 is quasi-stable (resp. asymptotically quasi-stable) 
by virtue of the system dy,/dt = f, (y,,.... 4, (9, ...--. bn (OO, 9 (s=1,..., 2) 
with respect to y,,...,y,for the function @ (A) = inf [V’ (x,,..., xm) for #+...+ 
+ x8 >A'*) and the conditions 4) >0,..... .. Yo > O. 

Then the unperturbed motion x= 0 of the system (1.1) is stable (resp, 
asymptotically stable) with respect to some of the variables x,,..... + Xm- 

If in this case the functions V,,...,V, admit infinitely small upper limits, 


and the quasi-stability stipulated in (4)is uniform with respect to ¢,(resp. 
the asymptotic quasi-stability is uniform with respect to y,, ,,&), then the 
Stability of the unperturbed motion with respect to some of the variables 

is also uniform with respect to ¢,(resp. the asymptotic stability is uniform 
with respect to xp, f,). 

If we have exact equalities V, (x, t) = /, Vi, (x, 0... 2. 0 Ven (4, I), Xp. 6. 0, Se E) (S= 
= 1,...,4) in conditions (2), and the quasi-stability in(4) is unconditional, 
then conditions (3) and (7.1) may be omitted. 

When the functions /, do not depend explicitly on x, the requirement of 
quasi-stability is replaced by the requirement that the auxiliary system be 
Lyapunov stable with respect to some of the variables y,,...,y, for 
Yo > On tie es Yio > 0. 

In particular, by putting k=l!=1, m=n, we obtain the theorems of 
Corduneanu /29/, which in turn generalize the classical theorems of 
Lyapunov and the modifications proved by Persidskii /30, 4/, Ibrashev 
/19/, I.G. Malkin /31/, Krasovskii /24/, Masséra /8/, Zubov /9/, 
Mel'nikov /14/. 

Applications of differential and integral inequalities and fixed-point 
principles to stability problems and problems of the qualitative theory of 
differential equations have been studied by many authors, for example 
{32—37; 39, 41—45/. 

In these studies the norm of the solution (sometimes the generalized 
norm or seminorm) or the norm of the right-hand sides of the differential 
equations of the perturbed motion is restricted directly by the corresponding 
inequalities. This yields interesting results for various specific types of 
systems. 

M.A. Krasnosel'skii and Ya.D. Mamedov /49/, Opial /46/, M.A. 
Krasnosel'skii /40/, La Salle and Lefschetz /23/ use inequalities to 
estimate the function Vor some functional. In Z. B. Tsalyuk's note /51/, where 
the idea of applying the conditions for solvability of Chaplygin's problem to 
the qualitative theory of equations is formulated in a very general form, 
the inequality is satisfied by an operator which maps the set of solutions into 
a certain partially ordered set; see also /40/. 
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The idea of applying Chaplygin's theory of differential inequalities 
directly to generalize Lyapunov's second method while retaining a single 
function V is due to La Salle /23/ and Corduneanu /29/. The most general 
results were obtained by Corduneanu (the core of his arguments are the e 
theorems cited above) and Lakshmikantham /56—58/. 

Theorem 7.1 alsc yields stability criteria which use higher derivatives 
of the function V; sce, e.g., /17/. 

Theorem 7.2, If there exists a positive definite function V (x, 4) whose 
second derivative V (x, 1) < p(t) (x, 4, where p (is continuous and 


exp ae dt Jee < 00 
te 


fe 


(in particular, if p = const <0), then the unperturbed motion is stable. 

As is well-known, the method of Lyapunov functions is being increasingly 
applied to qualitative studies of differential equations /38, 50/. V.V. 
Nemytskii has drawn attention to the problem of global qualitative investi- 
gation of the method of Lyapunov functions and has surveyed the relevant 
literature /53/. In particular, P.N. Papush /54/ and M.B. Kudaev /55/ 
have obtained criteria which sometimes make it possible to visualize the 
type of system of integral curves using the properties of the function V and 
its first, second and third derivatives. In the simplest cases, these 
properties are of the same character as in Chetaev's first instability 
theorem (if one returns to the local problem). 

Theorems 7.1, 7.2, and the like suggest other criteria for the behavior 
of integral trajectories. 

Finally, let us formulate a criterion for the positiveness of the eigen- 
values of the solutions of a linear system with continuous and bounded 
coefficients, * generalizing Chetaev's criterion /2/ (k= }). 

Theorem 7.3. Suppose there exist quadratic forms V, (x, 4,.. ., Vs (x. 4 
with continuous and bounded coefficients, having the following properties 
in Pr: 

1) Viti 9 DO... Vix >O(L<&) and V, (x, +...+Vs (x, 4 is positive 
definite; 

2) Vi (x. < pay Vi (XH... + pee Ve (x, I (S= 1... A): 

3) Pero 0 (Ss #1), Pos <0, Per = const (s, r= 1,..., 2); 

4) all the roots of the equations |p, —A6,,|7 = 0 have negative real parts. 
Then the unperturbed motion is exponentially stable, and all the eigenvalues 
of the solutions are positive. 

In conclusion we note that the prospects for new stability criteria using 
several functions V and higher-order derivatives are apparently far from 
exhausted. We mention several directions for further investigation. 

Stability criteria may be based on the distribution, topologically speaking, 
of the integral curves of the auxiliary system in the space {V,,...,Vs) or 
{V, V,...,V@,} at least for k= 2, 3. 

Different constructive stability criteria may be obtained as special cases 
of the theorems of §§ 4, 7. 

Using various generalizations of inequality fixed-point theorems the given 
criteria may be modified. 


° Theorem 7.3 can be modified to yield an estimate of the elgenvalues of the solutions of a nonlinear system 
(1.1) with the aid of functions V which satisfy estimates characteristic for quadratic forms. 
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Material on the problems considered here can also be found in the books 
/59—63/, and in papers of Krasnosel'skii(/64/ and at the present conference). 
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G.N. Mikishev and B.I. Rabinovich 


SOME PROBLEMS IN DYNAMIC STABILITY ANALYSIS 
OF MECHANICAL SYSTEMS WITH DEFORMABLE 
ELEMENTS 


1. Formulation of the problem 


Consider a solid body containing a cavity partially filled with liquid, in 
a state of equilibrium or unperturbed motion; it is assumed that the field 
of mass forces of this motion has a potential. The body may be either rigid 
or elastic; in the latter case we shall assume that there is some axis of 
geometrical, mass and rigidity symmetry, whose direction coincides with 
that of the field of mass forces of the unperturbed motion. We shall study 
the stability of this motion for a given system of external forces, assuming 
that the generalized coordinates and velocities characterizing the motion 
are small. In general this does not imply the possibility of linearization 
with respect to all the generalized coordinates, since the existence of 
Small nonlinearities (such as backlash, insensitivity zones) is permitted 
for the separate coordinates. The liquid is assumed to be of low viscosity, 
in the sense that the Reynolds number characterizing its perturbed motion 
is large. In other respects, we adopt the usual premises of the theory of 
small wave motions and of the linear theory of elasticity. 

The authors undertook an adequate description of this mechanical system 
with a view to studying both stationary and nonstationary perturbations of 
the motion in a finite time interval, employing theoretical and experimental 
methods, digital and analog electronic computers, and physical models. It 
was therefore necessary to represent the equations of the perturbed motion 
in a definite form. The form most suitable is a system of ordinary 
differential equations of finite order, which is nearly canonical in the sense 
that the nondiagonal terms of the coefficient matrix are smail. Stability 
problems may then be attacked by various approximate methods using 
algorithms easily realized on digital computers. Moreover, such systems 
are also convenient for the study of perturbed motion on analog computers, 
by virtue of the easy communication with the equipment and the proximity 
of the frequencies of the system to the corresponding partial frequencies. 
Finally, by employing equation systems with almost diagonal coefficient 
matrices the generalized coordinates, while their intuitive physical meaning 
is maintained, may be interpreted as quasi-principal coordinates of the 
mechanical system in question. 
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2. General framework for the solution of stability problems 


1. Set up the general partial integro-differential equations and ascertain 
criteria of dynamic similitude. 

2. Select appropriate partial mechanical systems and construct ortho- 
gonal basis systems, using the eigenfunctions of the corresponding boundary- 
value problems, which represent the normal modes of oscillation. 

3. Use Galerkin's method to reduce the original partial equations toa system 
of ordinary differential equations; then reduce the latter to a system of 
order as low as possible. 

4. Determine the coefficients of the systems, in particular, the partial 
frequencies and the added masses. 

3. Investigate the local dynamic stability of the system at discrete 
points. 

6. Investigate the stability of the system in a finite time interval. 

This program is carried out on the basis of theoretical and experimental 
methods, which may sometimes duplicate or complement each other. 

Let us briefly consider the methods used. 

Selection of the partial systems is a fairly complicated problem, since 
inapt selection leads to strong interaction between the systems, and the 
corresponding generalized coordinates are not close to the principal 
coordinates of the system as a whole. 

In particular, it is usually expedient to choose as partial mechanical 
systems a liquid in a motionless cavity with rigid walls, and a solid body 
(rigid or elastic) with a liquid whose free surface is perpendicular to the 
field of mass forces of the unperturbed motion. The first system of basis 
functions is then the set of the normal modes of oscillation of the liquid for 
wave motions at the free surface, and the second system is the set of 
oscillation modes for an elastic body with a quasi-frozen liquid. Both the 
oscillation modes and the corresponding partial frequencies canbe calculated 
theoretically and determined experimentally. At the same time it is useful 
to determine the damping coefficients reduced to the corresponding partial 
frequencies. It should be emphasized that the only adequately reliable 
approach to this problem is the experimental one; this is our approach in 
the present paper, for which certain special methods were required. 

Experimental investigation of the modes and frequencies of the natural 
oscillations of the liquid also makes it possible to determine the limits of 
applicability of the linear theory and to introduce corrections for small 
nonlinear effects. 

Both the reduced damping coefficients of the oscillations of the liquid 
and their experimental values were determined by both theoretical and 
semiempirical methods in the presence of comparatively small dissipative 
effects: influence of the latter on the oscillation frequencies and modes is 
thus negligible and the resulting motion of an ideal liquid may be studied 
independently. 

Once the basis functions are known, there is no essential difficulty in 
setting up the equations by Galerkin's method, but for high-order systems 
this operation becomes very tedious; special computer programs were 
prepared for this purpose. 

The problem of reducing the complete system to one of lower order is 
complicated and does not have a universal solution. It is usually solved on 
the basis of the physical nature of the problem. 
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The coupling coefficients (added masses) were also determined on 
physically similar models, developed for this specific purpose by an 
experimental method. 

The main difficulty in studying the stability of motion of the mechanical 
systems in question over a finite time interval is connected with the high 
order of the corresponding equations of the perturbed motion (up to order 
40 and higher) and with the need, in a number of cases, for complete 
information on the character of the perturbed motion, including the 
frequencies and amplitudes of auto-oscillations. The usual stability methods 
are not effective for the solution of these problems even if modern high- 
power computers are used, and in a number of cases they are simply 
unsuitable. Therefore special methods had to be worked out for studying 
both local stability at discrete points and stability over a finite time interval. 

All these methods are based on the specific structure assigned to the 
equations of the perturbed motion (small nondiagonal elements in the 
coefficient matrix) and the physical features of the system in question (small 
dissipative forces and harmonic linearization of the nonlinear sections). 

As a result, we have developed methods which permit the effective 
solution of stability problems on digital and analog computers. A brief 
description of these methods will be given below. 

We particularly wish to emphasize the possibilities of solving stability 
problems in complex mechanical systems with deformable elements by the 
use of analog devices, in which the mechanical system is represented by 
a physically similar model and the known external forces by special force 
generators. The work program of the latter is given by an analog computer 
whose input consists of instructions from the position and velocity sensors 
of the elements of the physical model. Using this method one can, in 
principle, take into account all the features of the actual mechanical system, 
including the most subtle dissipative and nonlinear effects in the most 
complete manner. 

This method was extensively employed in carrying out the present work. 
The body-liquid system was represented by a physically similar model, 
and the external forces were determined by the analog part of the complex. 


§ 3. Some results of investigations into the dynamics 
of liquid-filled rigid bodies 


We present some results concerning the frequencies of the natural 
oscillations, the damping coefficients and added masses, which characterize 
the perturbed motion of a liquid partially filling moving cavities of different 
configurations with absolutely rigid walls. 

We obtained theoretical solutions by separation of variables, variational 
methods and methods of long-wave theory (in cases of small relative depth). 

In so doing, our main interest was in reliable determination of the added 
masses, with preassigned accuracy. For this purpose systems of coordinate 
functions (usually harmonic functions satisfying the maximum possible 
number of boundary conditions) were specially chosen, with regard to the 
specific form of the cavity. 

For a check, the solution was usually duplicated, using an auxiliary 
system of basis functions. The convergence and stability of the machine 
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computations were specially studied, and approximate solutions used for 
checking purposes. The frequencies and damping coefficients were 
determined experimentally by the method of free and forced oscillations. 
The added masses were determined experimentally by a specially developed 
technique involving the partial frequencies, the frequencies of the system 
and the measured amplitudes of the oscillations of the liquid. 

As an example we cite theoretical and experimental results for cylindrical 
cavities with radial and coaxial partitions, spherical, conical, and toroidal 
cavities. There is complete agreement between the experimental and 
theoretical results, obtained for cylinders by the method of separation of 
variables, and for other cavities by the variational method. There is also 
good agreement with results obtained for some special cases by other 
authors. 


4. Some results of investigations into the dynamics of 
liquid-filled elastic bodies 


Given below are the results of theoretical and experimental investigations 
of longitudinal and transverse oscillations of thin-walled elastic cylinders 
and spherical thin-walled shells filled with liquid. 

The theoretical results were obtained by rigorous solution of the hydro- 
dynamical problems by separation of variables and in some cases by 
reducing the boundary-value problem to an integral equation, followed by 
calculation of the modes and frequencies of the shell oscillations by the 
Bubnov-Galerkin method. 

The experimental results were obtained by studying physically similar 
models. Our main interest was in the oscillation frequencies, and we 
carried out a qualitative investigation of the oscillation modes for different 
numbers of nodes along the periphery. 

Comparison of the theoretical and experimental results shows satisfactory 
agreement, except for cases of a large number of half-waves along the 
periphery (6—7), which display appreciable discrepancies requiring 
additional analysis. 

For specific shells results are given, as examples, for cases of full, 
half-full and dry shells. The agreement between theory and experiment 
is quite satisfactory in all these cases. 


é 


5. Investigation of the stability of unperturbed motion 
over a finite time interval 


We present some results of investigations of the perturbed motion of 
mechanical systems in question over a finite time interval, performed on 
a digital computer. 

In the first stage we consider local stability by frequency and algebraic 
methods, based on investigation of the complex eigenvalue spectrum. In 
solving these problems the specific structural features mentioned above 
were used for the high-order equations of the perturbed motion. In a number 
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of cases this yields simple geometrical forms for the elements of the gain- 
phase plot of an open-loop system, and simple approximate formulas for 
the roots of the characteristic equation of an open- and closed-loop system, 
based on an iteration method. 

In the next stage we solve the problem of complete integration of the 
system of equations of the perturbed motion over a finite time interval ona 
digital computer, taking the principal nonlinearities into account. 

To this end we effect a real transformation of the coefficient matrix 
for even derivatives, which leads to block diagonalization. In so doing the 
terms of the equations involving odd derivatives and nonlinear terms are 
assumed to be multiplied by a small parameter. 

All these terms are transferred to the right-hand side and transformed 
appropriately in passing to the new generalized coordinates. 

Essentially, we are realizing here A.I. Lur'e's idea of canonical 
transformations generalized to a special case of equations with slowly 
varying coefficients. Employing this method on a digital computer, we can 
carry out, with reasonable accuracy, diagonalization of a system of any 
order up to 40, actually performing this operation for a system of half the 
order; the corresponding transformations are real and cope with the case 
of real and complex roots, including also multiple ones. 

The final stage is to apply the method of harmonic balance to each of the 
resulting independent subsystems with nonlinear terms of order e (degener- 
ating in certain cases to second-order equations). 

The equations for the envelopes are integrated on a digital computer by 
a numerical method, allowing for variable coefficients in the general case. 

To summarize, we can now envisage the most complete investigation 
of the dynamical properties of the mechanical systems in question over 
a finite time interval. 
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EQUATIONS OF MOTION AND THE ROLE OF SMALL 
PARAMETERS IN THE DYNAMICS OF NONHOLONOMIC 
SYSTEMS 


I. Equations of motion of nonholonomic systems 


After Newton's discovery of the laws of mechanics, no essential 
difficulties were involved in setting up the equations of motion for a system 
of material points. However, the problem of eliminating the unknown 
reactions of the contraints then arose. For holonomic systems with ideal 
constraints, the problem was solved by Lagrange. For nonholonomic 
systems it was first solved by S.A. Chaplygin /1/. In 1895 Chaplygin 
proposed equations which seemed fundamental for analytical mechanics of 
nonholonomic systems: 


m a i) (1) 
dat) ar) , wv ar By) BS) 
dt qs 0q, - 04; + fey oq, 2. ( 04, aa, ) & > . (1) 
where T=T (gmsr.- +++ 9a» U1» +++) Im» 9me1,++ +1 Qn) iS the kinetic energy of the 
system, V =V (qmu,---» 9a) iS the potential energy. BW are the coefficients 
in the relations 
- oy ; i= 1, 2y20.5M ) 2 
qi. = By Cone Pye 7s (Paneer a aca (2) 


which are the transformed equations of the nonholonomic constraints 


AY (qmers +++ 4a) +...+ A (marys +++ 9a) Gn = O. (3.3) 


The function (7) = T (qms1.-- +> Qn» Be ga,» » «1 Bx” Gus Qmeors © > In): 

Although equations (1) are valid for a limited class of nonholonomic 
systems (the so-called Chaplygin systems), they actually describe almost 
all known nonholonomic systems. 

In 1901 P. V. Voronets /2/ obtained the equations of motion of a non- 
holonomic system for the more general case in which T, V and Bf’ may 
depend on all the generalized coordinates as well as on time. For Chaplygin 
systems Voronets' equations coincide with (1). In 1898 V. Volterra /3/ 
proposed equations of motion in kinematic characteristics, later to be called 
quasi-velocities. It should be noted that quasi-coordinates were actually 
used (though without proper justification) by Chaplygin /1/ in solving the 
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problem of plane nonholonomic motion. In /4/ one of the authors derived 
conditions for the validity of Chaplygin's equations in the case in which not 
all the generalized coordinates are true, generalized them to the case of 
quasi-coordinates. Volterra's equations are nevertheless correct, despite 
the fact that his methods were wrong. In 1901 G. Maggi /6/ derived the 
equations of motion in a form which yields Volterra's equations. On the 
basis of Maggi's equations, one can introduce an acceleration function 


N 
S= +m m, (x3 + y3 + 2), express it in terms of generalized coordinates, and 
i= 


oS 

a 
himself derived in 1899 on the basis of the principle of least constraint /7/. 
L. Boltzmann /8/ (1902) and G. Hamel} /9/ (1904) established a form of 
equations in quasi-coordinates which coincides for holonomic systems, 
referred to moving coordinates axes, and for nonholonomic systems. Let 
the quasi-velocities be related to the generalized velocities by (in general) 
nonintegrable relations @, = ay, (q,, -» +» Qa) 9a; Ga = Bar (Qu «> +» Gn) Or; (@rsBsx = 8,,). 
Then the Boltzmann-Hamel equations have the form 


obtain P. Appell's equations = Q,;(s= 1,2,...,a —m), which Appell 


$ (2) 4 Bre - =, (=1, 2,...,A), (4) 
where 
Tru = 2 2 Bu Bus ( a = a) ; 
T° = T (97, Brr@r) 
and or bs 2 By . For nonholonomic systems m quasi-coordinates are 
=1 


on, 


functions whose time derivatives are equal to the left-hand sides of the 
equations of the nonholonomic constraints (3). In this case (4) contains 
only (an — m) equations. 

Later versions of the equations were developed by J. Tzenoff /10/ (1924), 
G. Vranceanu /11/ (1926), J. Schouten /12/ (1929), and others using the 
equations of the nonholonomic constraints in different ways. If the initial 
d'Alembert-Lagrange equation is transformed as 


d / dT oT 7 d 
“ar (84) — OT + (84 — edn) = Qeae @=1, 2,...,). 


then everything depends on the specific commutation relations used (i.e., 
the value of the expression (84.= 7 84)). In the language of geometry this 


means that the form of the equations of motion is determined by which 
curves in the configuration space are chosen as comparison curves for the 
trajectory of the real motion. 

As shown in /13/, the problem reduces to some method of extending the 
definition of the operators d and 6 outside the curve of motion (preserving 
their usual meaning on the curve itself). According to one method, the 
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operators d6é and 6d commute for all the generalized coordinates: the 
comparison curves are then kinematically impossible trajectories, and we 
obtain the Boltzmann-Hamel equations in quasi-coordinates. By another 
method, the operators d5 and 6d commute only for independent generalized 


coordinates, and for dependent coordinates the expressions $4, — 5 841 


depend on the equations of nonholonomic constraints. The comparison 
curves are then kinematically possible trajectories, and we qbtain the 
equations of Voronets, Chaplygin, Maggi, and others, in true coordinates. 
This general approach yields another (m — 1) cases /14/ in which the equations 
of motion contain both quasi-coordinates and true coordinates: these cases 
are intermediate in form between the Boltzmann-Hamel and Voronets 
equations. 

As early as the beginning of this century, it was remarked /15/ that the 
equations of motion of electromechanical systems containing distributed 
conductors with sliding contacts do not have the form of the Lagrange 
equations of the second kind. It was assumed that such systems are a 
variety of nonholonomic systems. In 1952 A.V. Gaponov showed /16/ that 
attaching a sliding contact to a conductor distributed either in a volume 
or over a surface is equivalent to imposing a nonholonomic constraint on 
the current distribution. Gaponov proposed new equations of motion for 
Chaplygin systems: 


an 
d ole dL* i oLe® - 
es eee ees pile) ee r=m+i1,...,4), (5) 
a 09, 09, ¢ -s u,, rs Q, ( + ® ) 


where the Lagrangian L’ is obtained from L by using the nonholonomic 


constraints (2), replacing the coordinates gq, in the coefficients BY? (gma. - +» Ga) 
by independent parameters u,, so that Bf” = BY? (us, mets Urmene oe »Uen). Thus 
L* =L? (mars -- +s Yne Ymoav~ +++ Gane Umer, meas Umer, meas. sees » Uan). . After carrying out 


the operations indicated in equations (5) one must put u,=—4q. It is advisable 
to use Gaponov's equations when the number of generalized coordinates is 
very large and the number (a—m) of degrees of freedom small. This is 
precisely the situation in the general dynamics of electrical machines. 

In a discrete description of an electrical machine with drum winding, 
the electrical state is characterized by a countable set of generalized 
coordinates. The commutator is a device which makes it possible to fix 
the current distribution in the revolving rotor in the coordinate system 
relative to the brushes. Thereby a countable set of nonholonomic 
constraints is imposed on the electrical coordinates, and as a result the 
system has a finite, usually small, number of degrees of freedom. The 
division of electrical machines into those with or without commutators 
corresponds to their division into holonomic and nonholonomic systems: 
commutator machines are Chaplygin systems. 


II. The role of small parameters in the dynamics of 
nonholonomic systems 


In this section we study the influence of small parameters on the dynamics 
of nonholonomic systems in a number of concrete examples. This helps to 
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clarify the problem of the idealization involved in the neglect of small 
physical parameters when constructing a mathematical model of a non- 
holonomic system. As is well-known, an indication of the admissibility of 
some specific idealization can be inferred not only from a comparison of 
theoretical results with experimental data, but also from a comparison of 
the results of two different theories, only one of which utilizes the given 
idealization. The examples we present are quite varied, but they are all 
related by a common approach based on the division of the phase space into 
subspaces of slow and rapid motions. When the slow motions are stable 
relative to the rapid motions, the small 
physical parameters in question may 
safely be neglected. In the opposite 
case, the role of the small parameter 
is important. 

As a first example we consider a 
system with a nonlinear nonholonomic 
constraint. 

The derivation of the equations of 
motion of mechanical systems with 
nonlinear nonholonomic constraints has 
been studied extensively /17—22/. At 
the time, Appell /18/, P. Delassus /23/, 
H. Beghin /24/, and others hotly debated 
whether it was possible to realize non- 
linear nonholonomic constraints: 

s however, these studies essentially 

contain no examples of systems with 

Se nonlinear ideal nonholonomic constraints 
differing significantly from the example given by Appell /18/ in 1911. This 
example was considered later by Hamel /20/, who set up its equations of 
motion by two methods. First, on the basis of the general equations of 
motion for systems with nonlinear constraints, Hamel obtained equations 
of the following form for the motion of a system by inertia 


xy—xy =0, (a°+5%)z = — gd? (6) 
and the equation of a nonlinear nonholonomic constraint 
. . a® - 
2 + y" = = 2". (7) 


Introducing the angles 6 and 9 by x = a@ cos 6; gy = ag sin 8; 2 = dp, (6) then 
becomes the system 


6= 0, (a* + b*)@ = — gb, (8) 


and equation (7) becomes an identity. Hamel then arrived at the same 
equations (8) by letting (p — 0) in the system 


p (Pp) + aby) = 0, (a+ 6°) — apd = — gb, (9) 
which describes the motion of a nonholonomic system (Figure 1) with 


ordinary linear constraints. The agreement between the two systems of 
equations convinced Hamel of their correctness. However, since in passing 
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to the limit the order of the system of differential equations (9) is reduced, 
i.e., degeneration occurs, it is not a priori clear whether the limit motions 
of the nondegenerate system coincide with those of the generate system. 
The present authors showed in 1963 /25/ that these motions are different. 
Thus, a more rigorous approach to the study of the system in the 
Appell-Hamel example leads to motions which cannot be described by 
Hamel's equations. * 
In fact, let us introduce new variables £ and yn defined by 6 =ak; @ = Bn, 
where a= ) 1 + 6*/a*, B = gb/(a? + 6%). Denoting the physical parameter of 
the system by » = |p//aB > 0, we write equations (9) in the form 


pE=FEn, n= —1 +p? (10) 


where the upper sign refers to the case p> 0, and the lower to the case 
p<0. The motion of the system under discussion can be compared toa 
displacement of the representative point in the (£. n) plane. To delineate 
the trajectories in the (, n) plane, we divide the second equation of (10) by 
the first and after integrating the resulting equation (in which the variables 
are separable) we obtain a family of integral curves 


ny = p (+ In §? — pE*) + const, (11) 


shown in Figure 2. Consider the case p>0. The motion of the represen- 
tative point in the (&, n) plane is then described by the equations 


nE=—tn, n= —1 + pe. (12) 


Expressing the arbitrary constant in equation (11) in terms of the initial 
values §= & n=, we obtain 


n= 7 (8), 1) =e [In F-— 8) ] +b. (13) 
0 
1) 
| 
F g 
a b 
2 


* A similar situation obtains for the friction regulator in V.S. Novoselov’s example. 
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Using (13), equations (12) give the function § = € (Qin the form 


E 
dt 
*) evrey. o 


where the upper sign corresponds to the motion of the representative point 
along the upper half of the closed integral curve (Figure 2a) through the 
point (£), n.), and the lower sign to its motion along the lower half. The 
angle 6 of the plane of revolution of the wheel and the angle 9 of its own 
revolution are, in terms of the variable &, 


3 
d 
0-6, = Faal rar) © = Fein E. (15) 


The absolute value of the velocity vg of the point B of contact between the 
wheel and the plane (Figure 1) is 


Os = a|9| = aB Vf (8). 


Hence it follows that when £ assumes the values £ = §, and §=£,, where &, 
and §, are the roots of the equation f(&) =p [In (E/E — p (&? — &3)1 + 42 = 0 (the 
condition for intersection of the integral curve with the axis n= 0), the 
point B halts momentarily. Since we then have 
6-0 and @ changes sign, the trajectory of the 
point B forms a''beak-shaped' curve with a 
cusp. Between these singular points the 
trajectory is a segment of a spiral curve. Thus, 
in one revolution of the representative point 
around a closed curve of Figure 2a, the point 
B describes a curve which we call acell. One 
of these cells is marked M —N —M in Figure 3; 
the point M corresponds to £ = §, and the point 
Nto$=&,. A complete trajectory consists of 
identical M— N —M cells along an annular strip, 
since each cell is symmetric with respect to 
the line D,D, (Figure 3). Depending on the 
initial conditions and the relations between the parameters of the system, 
the trajectory of the point B is either a closed curve or a quasiperiodic 
curve everywhere dense in the annular region. 

When p —0 the whole (§,) plane becomes a region of rapid motions 
(with respect to the coordinate &), with the exception of the line § = 0 which 
is the axis of slow motions. The half-line (€ = 0,» > 0) is then stable, and 
the half-line (£ = 0,» < 0) unstable relative to rapid motions. Let us find 
the limit motion of the system when » -0. To this end we consider the 
motion of the representative point in the (€, n) plane along one of the closed 
integral curves. The limit position (for p —-0) of the integral curve is 
shown in Figure 4. In fact, it follows from equation (13) that when » —0 
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degenerates into a pair of straight lines n = +. The roots £, and &, then 
tend to zero and to infinity, respectively, according to 


[bile [Ela M. 


Using these expressions in conjunction with (13)—(15), it is easy to 
calculate the variations of the time ¢ and the angles 6 and @ for the passage 
of the representative point through the segments OD,DE,..., HO in Figure 4. 
As a result we arrive at the following motion of the system for p — 0, (p > 0): 
during time ¢, = n. the wheel rotates through an angle 9, = — fn?/2, moving 
along a straight line 6, = const, andthen during time ¢, = »,/2 rotates through 
the same angle 9, = 9, = — fn?/2, and its plane turns through an angle 6, =a). 
The trajectory of the point B in this section has the form of an arc whose 
length is equal to that of the previous rectilinear segment. The plane of the 
wheel then turns instantaneously through an angle 6, + 6, = 2a, and the 
direction of the intrinsic rotation of the wheel is reversed. The trajectory 
of the point B then forms a corner point with a finite angle 6 = 2a. Then the 
wheel again describes an arc-like trajectory, rotating through an angle 
%. = Bn’/2, during time ‘, = n,/2 and its plane turns through an angle, 6§,=«¢ 
Finally, during time ¢ =m, the wheel moves along a Straight line, rotating 
through an angle 9, = 6n72. At the end of this segment of the trajectory, 
the direction of the wheel's intrinsic rotation is reversed, and the direction 
of the motion changes. The complete trajectory of the point B has the form 
of a rosette, either closed or an open curve everywhere dense in some 
annular region. The dimensions of this region are proportional to the 
initial value |n,| of the intrinsic angular velocity of the wheel. 

It is easy to see that this limit motion differs from the motion of the 
system (p= 0). In fact, in the limit case, for p-—0, the equations of motion 
(10) become §=0; y= —1. 

Hence it follows that for any initial conditions, the motion of the system 
corresponds to a displacement of the representative point in the (E, n) plane 
along a straight line § = 0 at a constant velocity 7 = —1. This means that 
the trajectory of the contact point 
of the wheel with the plane is always 
a straight line along which the wheel 
rolls with constant acceleration. 

The case p< 0Ois similar: the 
trajectory of the point B coincides 
qualitatively with that of the Chaplygin— 
Carathéodory sledge. For » —0 the 
form of the trajectory is the same, 
except that the cusp becomes a corner 
point. 

4 As another example clarifying the 
role of the parameters neglected in 
the idealization of a real physical 

system, we mention the influence of the usually neglected area of contact 
of a sphere with a plane on the dynamics of a sphere rolling along a rough 
plane. Consider the inertial motion of a homogeneous heavy sphere over 
a horizontal rough plane and take into account that the contact surface of 
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the ball with the plane is a small circle of radius a. The case a=0 
corresponds to point contact, which in the case of rolling without slipping 
leads to a nonholonomic constraint. The limit 
system for a—0 is a sphere with point contact. 
Here too it is not clear a priori whether the 
limit motion of the system for vanishingly 
small a (a+ 0) coincides with the motion of 
the limit system (=0). A study of the 
dynamics of the sphere shows that they do 
coincide and, consequently, the idealization 
I involved in replacing the surface of contact 
by a point is justified. 
Finally, we consider an example in which 
the theory of differential equations in which the 
leading coefficient isa small parameter which contributes to the solution of a 
certain problem concerning the realization of a nonholonomic constraint. 
In 1933 Carathéodory /27/ studied the dynamics of a Chaplygin sledge, in 
the special case when the center of mass of the sledge is situated ona 
Straight line in the plane of the blade, The equations of motion of the sledge 
are 


u=aw’, ak*» = — uo, (16) 


where u is the velocity of the point of contact of the blade with the plane, 

@ is the angular velocity of rotation of the sledge, k= 1+ J/ma* (Figure 5). 
The reaction force R of the nonholonomic constraint prevents the sledge 

from slipping in a direction perpendicular to the plane of the blade, and 

therefore the velocity component in this direction is v=0. Assuming R 

to have the character of viscous friction, Carathéodory represented this 

force in the form 


R= —WNo, 


where N—the coefficient of viscous friction—is very large. For v+0 the 
equations of motion of the sledge are 


vu=eawo; &=ae*-+eawo; ak*d + uw =—eao, (17) 


where e= J/Na* is a small parameter. For e--0, equations (17) coincide 
with (16). However, if we consider the motion for the initial conditions 
Up = Up = 0, w = kc, where c= const +0, equation (16) gives initial values 
®) = 0, W, = —&c*, and equations (17) the values o,=0, o,=0. Hence 
Carathéodory concluded that the nonholonomic constraint in question cannot 
be realized by means of viscous friction forces. 

A more careful investigation disproves Carathéodory's conclusion. In 
fact, let us write equations (17) in the form: 2 


O=6; g=aw*+poc: ps = —ak*s— uo, (18) 


where v=yo;p = ae is a small parameter. The third equation of (18) implies 
that for » —0 the phase space (u, o,0) becomes a region of rapid motions, 
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with the exception of the surface 
ak*s + uw = 0, (19) 


which is a region of slow motions stable relative to the rapid motions. 

In the case of small values of the parameter yn, (u» + 0), the representative 
point in the (u,, 0) space reaches the p-neighborhood of the surface (19) from 
points outside a finite neighborhood of this surface in time of order 


p In , and remains there /28/. Now let us study the behavior of the 


function o =o (f). To this end we consider the motion of the representative 
point in the (u,,0) space. Differentiating the third equation of (18) and 
eliminating the variable o, we obtain from (18) a system of differential 
equations 


ak? S = a*k* wy? — pw (uw + pS), 
ak? © = — uw — ps, (20) 


pak* = = [p(u + pw") — a®k*}a — aw? + wu? + puw®, 


which describe the motion of the representative point in the (u, 0, o) space. 

From the third equation of (20) it follows that when p» —0 the (u, w, c) space 
becomes a region of rapid motions (with respect to the coordinate a), with 
the exception of the surface 


aks = wut — a®r*o!®, (21) 


which is a region of slow motions (Figure 6). These slow motions are 
stable relative to the rapid motions. For small values of the parameter 
p. (u #0), the region of slow motions is the p-neighborhood of the surface 


[p (u + po*) — atk*) 6 — a*k*w? + wu® + puw® — 0. (22) 


In a bounded region of the (u, o, 6) space the surfaces (22) and (21) differ by 
a small quantity of order p, and when p —0, they coincide. 
In the case of Carathéodory's initial conditions (u, = 0, 0, = &c, 6, = 0), the 
representative point in the (u,,s6) space is situated outside a finite 
neighborhood of the surface (22) and, by what has 
been said above, reaches the p-neighborhood of 


the surface in time of order p In 7. When »p-0 


this time interval tends to zero. For values 
u=0 and w= & on the surface (21), we have 

o = —kc*, and therefore when p — 0 the change 

of 6 from 6=0to c= —ke becomes instantaneous 
(see Figure 6). Thus, the contradiction which 
Carathéodory discovered is resolved as follows: 
although the values =o are indeed different at 
the initial moment { = 0 according to whether 

6 p=0 or p +0, (ph < 1), this difference disappears 
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in a time interval of order p In . Consequently, as p —0 the limit motion 


of the nondegenerate system (18) differs by an arbitrarily small amount 
from the motion of the degenerate system (p = 0)/16/. 

The above analysis implies that the nonholonomic constraint involved in 
the motion of a blade over a plane can be realized by means of viscous 
friction forces; it corresponds to the case of an infinite coefficient of 
viscous friction. 
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ANALYTICAL DYNAMICS AND OPTIMUM TRANSFER 


§ 1. Stationarity conditions 
Suppose the equations of motion of an object are 
X= Fy (x, us, t) (i, f= 1,2,.., R= 1, 2, 07), (1.1) 


where the u (Q are piecewise-continuous controls (control functions) and ¢ 
is the time. 
The controls must satisfy the conditions 


Un, & Ue Kes, Uy =const, usy = const. (1.2) 


In particular, for all or some controls we may have ug, = — oo, us, = oo, 
We shall assume that every admissible control corresponds to a single 
trajectory of system (1.1) issuing from a point with coordinates xi at the 
initial instant ¢,;. The final igstant will be denoted by ¢;, the coordinates 
of the final point by xf. 
The boundary values of the variables are related in general by implicit 
conditions 


Ry (xi!, xi', ty, ts) =0, (p=1,2,...,05 2n + 2). (1.3) 


A trajectory satisfying conditions (1.1)—(1.3) will be called admissible. 
The controls must be chosen so that the functional 


‘f 
Fo (Xs, us, thdi+R, (x;!, af, th, te), (1 -4) 
i 


be a minimum relative to its values for other admissible trajectories. 
The functional (1.4) on admissible trajectories is equivalent to the 
following functional: 
if 
U = \ Lat+R, (1.4') 


where i 


ben], 


a g 
L=Fot Di M(e— Fi), R=Ret+ D vRp 
om} 
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Here the 4, are explicit functions of time, to be determined, and the v, are 
undetermined constants. 
The controls u, may be replaced by new controls y, using formulas 
U, = Us (yx), where the derivatives du;/dy, vanish at the boundaries of the 
region (1.2). Examples of this procedure are discussed in /1, 2/. 
Alternatively, we need not express u, in terms of y,, but consider an 
increased number of controls and introduce additional undetermined 
multipliers 4,4, which are functions of time. We must then add the sum 


> Ante [a — Ur (ya)) to the function L. 


hwi 


The total variation of the criterion functional is given by the formula 


tt t 


{ +7 a a 
4 \ Lat = ) PE re— > | hat Se) Oxi] dt+ (D ddx— Hate. (1.5) 


ium} 


Here we denote 


H => Me —L = — Fo + DMF. (1.6) 


fel tes) 


Let the functional (1.4) have a smooth minimum. By equating AU to zero, 
we then obtain necessary conditions for an extremum 


kj = —@OH/dx,, (i=1,2,...,A), (1.7) 
0H [0v, = 9, (R= 1,2,...,7)5 (1.8) 
AR + (3) Au — Has)?! =0. (1.9) 


{=} 


If the total variations Ax, in (1.9) are expressed by the formulas 
Ax, = 5x; + x; At and the terms in Af and 6x, equated to zero, then instead of 
(1.9) we obtain 2n+ 2 transversality conditions in Bolza's expanded form /13/. 
We mention the frequently encountered time optimal control problem, 
when the criterion functional may not have a smooth minimum. In this case 


tf 
Fe=1, Ro=0, A \ Ldt = At;- At,, 
{ 


and formula (1.5) assumes the form 


(153 ay, Oe S (i+ 3 3 an]at + [35 der + A) a] =0. 


Sj ait {amt 
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Hence we obtain H;, = He, = —1, equations (1.7) and (1.8) and the 
transversality condition 


a b 
AR+ (3) udu) '=0, R=D wR, 
owt 


t 
fw} tf 


In the case of an increased number of controls, equations (1.8) are 
replaced by 
du, 
a 


Sag — basa = 0, Nash (1.8') 


Suppose the controls y, are not introduced, and the investigation involves 
only the initial controls u,. Equations (1.8) must then be replaced by the 
conditions 


OH/Ou,z=O0 for Up, < Ua< Ung, ‘ 
oH eH (1.8") 
a 0 for Ug = Up), au, >90 for Un = Uy. 
uy lbp 


Remark 1. The system (1.1) and (1.7) is a canonical system with 
Hamiltonian H, momenta 4, and coordinates x,. This canonical system 
differs from a mechanical canonical system in the linear dependence of H 
on the A, and the occurrence of additional variables u, or yy,. 

Since system (1.1) and (1.7) is canonical and the limit controls us, and 
Us, are constant, on any section of an optimal trajectory with continuous 
controls we have 


é a 
H=h4\(— 3+ 3 mG) ae (1.10) 


eo =) 


where hk = const, ¢, is the initial instant of the section in question. 

If the functions F, and F; do not depend explicitly on ¢, H is constant on 
any section of this type. 

Remark 2. A point of discontinuity of the controls is called a corner 
point. Let us vary the position and time of some corner point by quantities 
Ax,and At; using formula (1.5) for an optimal trajectory on both sides of the 
corner point, the variation of the criterion functional U is 


(3 diAx,;— HAt) — (3 AAx:—HAt) 


ay t-0 


where 
Ax |e-o = Ax; |<40 = Ax;, At |e~0 = At Ie+0 = At. 


For an optimal trajectory the variation of the functional must vanish. 
Hence we obtain the well-known Weierstrass— Erdmann conditions for the 
continuity of 4, and H along the whole optimal trajectory. 


~~ 
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In the case of stationary functions F, and F;, therefore, H is the same 
constant for any optimal trajectory. For example, for time optimal control 
it follows from the previous considerations that H=—1. 


§ 2. Maximum principle 


Integration of the variation of the criterion functional along the trajectory 
tubes is a method often used in analytical dynamics /4/. We shall use it 
to obtain necessary and sufficient extremum conditions. 

Consider any admissible trajectory AB of system (1.1) and (1.7) with 
controls {u). This trajectory passes through any two points A and B which 
satisfy equations (1.3) at some instants ¢, and {;. In particular, the points 
A and B and the instants f; and f; may coincide with the initial and final 
positions and instants of the optimal trajectory, and the trajectory AB may 
differ from the optimal trajectory only in some intermediate interval in 
which the controls {u,} differ from the controls {u,} of the optimal trajectory. 

In the 2n + l-dimensional configuration space of the coordinates x,, the 
momenta 4, and the time ¢, we pass a curve C*, through the point A and the 
initial point of the optimal trajectory: 


x} = xf (ap), M=Ai (ap), t! = tap, 01), ap = ap(H) (P= 1,2,...,2n). (2.1) 


We note that for the time being formulas (2.1) may be arbitrary if the 
condition df/a0 >0 is satisfied and the contour C. passes through the two 
indicated points. Additional restrictions on the choice of C,; will be given 
below. We shall call any trajectory of system (1.1) and (1.7)anextremal. 
Through a point of the curve AB and the section of the curve C,; between the 
initial point of the optimal curve and the point A we now pass extremals 
which correspond to controls which coincide with {u,}on the optimal 
trajectory. 

Suppose the relations (2.1) are such that for v= of the curve C? is 
transformed by the indicated extremals into a curve C, through the 
terminal point of the optimal trajectory and the point B. The control C, 
is determined by the general formulas 


xem ¥1(x1! (ay), a(t), £), 
Dam Ay (Ma! (ety), 24! (tp), ten (6), £), (2.2) 
b= f(ay, off), ay = ay (p). 
Through the points of the section of the curve C; between the initial point 
of the optimal trajectory and the paint A, we pass extremals with controls 
{u,), such that these extremals, for o=0'f, completely fill the section of 


the curve C, between the final point of the optimal trajectory and the point B, 
subject to the following conditions: 


Us (¢ (a, (p), v'i)) = Uy (t (a, (ps), v'1)), 
in (t (tp (12), O°f)) = tte (t (ay (pH), Of). 
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At any instant ¢, {us} and {u,) are boundary values for the controls {u,}. 
The last condition on the choice of the contour Cp and the controls {us} is: 


n ap. off 
AR+(S wdc — Hat)”, =O. (2.3) 
fet 


flay, ot) 


All the above conditions can be satisfied, by virtue of the arbitrary 
dependence of x/i and 4/i on a,, the functions a, on the parameter p, and uy 
on the time ¢, and the fact that the function ¢(a,, v) is also arbitrary. 

Let us integrate expressions (1.5) in the above field of extremals, whose 
controls coincide with the optimal controls {us}: 


T 


tf fon 
_ Ldt = — S) MAxi— HAt + 
i coi t; lend, 
Ts; a ry a 
Cule rc | ABt; jen} 


The integral on the left-hand side is calculated along the optimal 
trajectory, the first integral on the right-hand side along the curve Cé 
from the initial point of the optimal trajectory to the point A, the second 
integral along the curve C, from the final point of the optimal trajectory 
to the point 8, and the third integral along the trajectory AB. 

The quantities n, are determined by formulas (1.1) in the form 
n: = F; (x;, Ua, ). The function H and the factors 4, in the third integral on 
the right-hand side are calculated for the extremals of the field at their 
points of intersection with the curve AB. The controls occurring explicitly 
in H therefore coincide with the optimal controls {u). 

Integrating formula (2.3) in the field of the extremals with the controls 
{u,), we obtain 


Tr on Ty on 
( 3} uaa —Hat— SD Ade, — Hat = 
Culs len Chey sam} 
ti te ~T; ~Te ~~ 
w= Ro (x; 9 Xi ty, ty) — Re( x » X% » fy, tf). (2.5) 


By virtue of (1.6), which defines the function H, we find 


Dy Ame me H (xt, Dey te, f) +L (Xt Ae, Un, 8). (2.6) 


{ony 


Using formulas (1.4'), (2.4)—(2.6) and the fact that the optimal trajectory 
and the trajectory AB satisfy conditions (1.3), we have 


Tf 
Uaa— Vopr = [—H (Xt, des Uns t)  H (x2, At, ta, t)} dt > 0. (2.7) 
Any 
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Hence we obtain a necessary condition for the minimality of the criterion 
functional—that H be maximum on the optimal trajectory: 


H (x1, bt, Un, 1) > H (x1, Aes Ue, 0). (2.8) 


Condition (2.8) is the basic condition of L.S. Pontryagin's maximum 
principle /5/ and is equivalent to the Weierstrass condition /3/. From 
(2.8), in particular, follow conditions (1.8), (1.8') and (1.8''). 

Suppose condition (2.8) is satisfied for any controls {u,), certain controls 
{u,}, and x; and 4, solving equations (1.1) and (1.7) for any xji and Afi and for 
the controls {u.}. Then if conditions (1.1)—(1.3), (1.7) and (1.9) are 
satisfied on some trajectory with controls {u,}, then, as shown by the above 
considerations, the trajectory in question is optimal. 


§ 3. Application of the Hamilton-Jacobi method 


The optimal trajectory {x,} and the undetermined factors {4,} satisfy 
esanonical equations with Hamiltonian (1.6) and controls {u,}, considered as 
explicit functions of time. This canonical system can be solved by the 
Hamilton-Jacobi method. 

To this end, consider the partial differential equation 


B tH (sn x un (0), #) = 0. (3.1) 


By virtue of formula (1.6), equation (3.1) becomes 


av , maw 
art 2 ay, Fe= Fo. (3.1") 


Formula (3.1') shows that the Hamilton-Jacobi equation for the optimal 
control problem is a first-order linear partial differential equation. 

Suppose that for certain controls uw, (4 the complete integral of equation 
(3.1) has been found: 


W = W (x;, a, t) + do (3.2) 
where a, are n independent constants and a, is an additive constant. The 


solution of equations (1.1) and (1.7) for these controls uy, ( is then given 
by the formulas 


ov 
da; — (3.3) 
ow 
dz, — Me (3.4) 


where 6, are new independent constants. 
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It is well-known that the complete integral of equation (3.1) is determined 
up to an additive constant by the action integral. Therefore 


t 


Vv = Ldt + a. 
{ 


The function R in formula (1.4) depends on the initial and final values of 
the variables and is a certain additive constant. Hence, the criterion 
functional U is equal to the complete integral of equation 
(3.1), up to an additive constant. 

On the basis of formulas (3.1) and (3.4), for any explicit representation 
of the functional on trajectories of system (1.1) and (1.7) we have 


eu ou 
a, Gp HH. (3.5) 


Formulas (3.5) can be used in programming numerical solution of 
variational problems, for example, by the method of residuals /6/. 

If a, = Afi and 6, = xji, the complete integral of the Hamilton-Jacobi 
equation has the form /7/ 


¢t a 
Wx, a, =| Ldt +>) xf! ng! (3.6) 


i fat 


To illustrate the application of the Hamilton-Jacobi method, let us obtain 
a condition for optimal coplanar impulsive transfer between elliptical orbits 
in the gravitational field of a spherically symmetrical central body. 
We consider the motion of a point in a system of polar coordinates (r, 9) 
with the origin at the central body. The equations of motion are 
v, = vir? — k377 + B cosy, 
Ue = U-Uer + BPsiny, | (3.7) 
r=, 9 = 067, 


where # is a constant, uv, and u,are the velocity components of the point 
along the radius-vector and the transverse direction, B > —u, Inm, u, 
is the constant effective exhaust velocity, mis the mass, » is the angle 
between the thrust direction and the radius-vector, measured in the 
direction from the radius-vector to the velocity of the point. 

Neglecting terms of the order of the ratio of the longest propulsion 
period to the total flight time, the equations of impulsive changes in velocity 
for these sections are 


vp =Bcos, ve=fPsiny, r=0, @=0. (3.7') 


Based on the possibilities of rocket engineering and permissible overload, 
a restriction must be imposed on the control: 


0<8<G, G=const. (3.8) 
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The equations of motion for the coasting periods can be obtained from 
system (3.7) by setting § = 0. 

Understanding optimum in the sense of least mass flow, the criterion 
functional is 


ff 
\ Bdt = u,In(m,m;'), (3.9) 


ry 


where m, is the initial mass and m, the final mass of the ship, f; and fy 
are the initial and final instants. 

The functional (3.9) must be minimized subject to the constraints (3.7) 
or (3.7'), the restriction (3.8) on the control 6, and the boundary conditions 


/8/ 


; = , ae 
v} = hep; 3 sin/,, vs = kp? re. ni = DP; (I + e,;cos/;)"}, 
9, =f, +i (3.10) 


1 1 
of = ke;p, ® sinf;, of = kp? rp*, rp = pp (le cosh), @p= fy ey. (3.11) 

Here "i" refers to the initial orbit, and "f' to the final orbit. Other 
notations used in formulas (3.10) and (3.11): e is the eccentricity, pis a 
parameter, ois the angular distance of the pericenter from the polar axis, 
f is the true anomaly. 

In the case under consideration R, = 0, and the boundary conditions are 
expressed by the explicit relations (3.10) and (3.11). We therefore do not 
introduce the undetermined constants (3.10), and take the transversality 
conditions in the genera! form 


(Av, + Aadoy+ dade + hdg—HAt)i! = 0. 


Here the total variations of the variables are calculated along the limit 
orbits. By virtue of formulas (3.10) and (3.11), the variations of all the 
variables except Af are expressed in terms of Af; and Af;. If the formula- 
tion of the problem does not involve a specific motion along the limit orbits, 
the variations Af;, Af,;, Afy, and Afy are independent. Hence we obtain 
H; = Hy =0, and because of the stationarity of equations (3.7) along the 
whole optimal trajectory H=0. The variations of the velocities and 
coordinates on the limit orbits are proportional to the right-hand sides of 
equations (3.7), calculated on the limit orbits for Bp =0. 

We obtain the following transversality conditions 


4 4 
D> MFi=0, Saf Ff =0, (3.12) 


fmt fan} 


where F} and Ff are the right-hand sides of equations (3.7) for the initial 
and final orbits respectively. 

On the basis of equations (3.7') we set up the function H for the propulsion 
periods 


H =B(—1+A,cosp+Assinp). (3.13) 
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The Hamilton-Jacobi equation is 
ow : ov ow. 
re +B (—1 + 5, 6S + 3o, Sin) = 0. 
The complete integral of this equation is 
t 
W = a,vu, + ave + | Bl — d, cos  — ag sin p) dé + asr + aq. 
fe 


The integrals (3.3) and (3.4) give 


f 


t 
v.— \Beosy= br, vy —{Bsing = by, r= bs (3.14) 


e te 
p= dn, a=A,, GQ, = As, G3 = As, GQ, = Ay. 


Let us turn our attention to the maximality condition for H on the optimal 
trajectory. Comparing the first and second partial derivatives of the 
function (3.13) with respect to Bf and », we reach the conclusion that the 
propulsion periods may correspond to trajectories of: a) programmed 
thrust if 0H/o6 = dH/dp = 0 or 


Ay =cost, As =siny; (3.15) 


b) maximum jet acceleration G if dH/08 >0, dH/ap=0. Since H = 0 and 
H is linear homogeneous function of B, the inequality sign for the derivative 
OH/oB must be omitted, and the undetermined multipliers are also deter- 
mined by formulas (3.15). 

Formulas (3.14) and (3.15) show that the angle of inclination » of the 
thrust remains constant for any propulsion period. For each such period 
we obtain, by virtue of (3.7) and (3.14), 


vp—o-; = V cos, Ve —¥, = Vsiny, r=, o=®, (3.16) 


B—Be=V, V=[(0,-—v%) + (vp —v8)}¥. 


Formulas (3.16) imply in particular that the mass flow under impulsive 
conditions is determined by the required velocity increment and is inde- 
pendent of the specific combustion law. 

Let us find the expressions for the undetermined multipliers for the 
coasting periods by the Hamilton-Jacobi method. 

The Hamilton-Jacobi equation has the form 


a + io (08r- — br) — F 0,090 +2, + x ver = 0. (3.17) 


The complete integral is 


VW =a,t+a.9+ ® (Ur, Uy; F, Ay, Ag, Gg, Ay), 


where @® is some function of the indicated arguments. 


165 


NOVOSELOV 
On the basis of formula (3.4) we thus have 


V 
y= oy =o 


Therefore, for both coasting and propulsion periods, A,= const. By virtue 
of the Weierstrass-Erdmann conditions, A, is the same constant for the 
entire optimal trajectory. 

For coast trajectories f = 0, when L =0, and on the basis of formula 
(3.6) the complete integral has the form 


4 
W (x, 2, t) = >) xP AY. (3.18) 


f=1 


Using formula (3.4) we obtain expressions for the undetermined 
multipliers: 


= g wer 
Pl Ox, * (3.19) 


As solutions of equations (3.7) in the coasting periods we take 
o, = kep-hsinf, ve = kp-*(1 +ecos/), r=p(l+ecos/)', p=f+o. (3.20) 


Here e,p and o are the eccentricity, a parameter and the angular distance 
of the pericenter of the elliptical transfer orbit from the polar axis, / 
is the true anomaly in this orbit. 

We note that the true anomaly is expressed in terms of time by means 
of the relations /8/ 


ts] m 


ig f= 728 x, E—esinE = ka-(t —¢,), (3.21) 


where ¢, is the time of passage through the pericenter, a is the major 
semiaxis, which is p (1 —e*)"' for an elliptical orbit. 

Denoting by s (k = }, 2, 3, 4) the Keplerian elements e,p,w and ¢,, formula 
(3.19) assumes the form 


mad A, Aad 2 Kconst (3.22) 
(=o *o,° n= 2s 1 95, = const. 2 

It is immediate that A, = —H. Introducing the notation A = —k-'e'p*A,, 
B= ke"p'A,,D = k'e-1p (2 —1) Ay + 2k! p’ Ay, C = — ke p*A, and calculating 


the derivatives 0s,/dx, with the aid of relations (3.20) and (3.21), we write 
expressions (3.22) for the case e#0 in the form 


A, = Acos/+ Besin/+C/, 
Ae = — Asin / (2+ ecosf/) + B(1 + ecosf/) + D(1 + ecosf)*>+CJ, 
Ne = — kp (1 + ecos/)*[Asin/ (1 + ecosf)'*— B—D(1 + ecos/)*+ CK), 
he = — hep“ A. (3.23) 
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where /, J and K are Lawden's functions /9/ 


IT=sinf Sy sin 7 (1 + ecosf)* df, 


J=e 'ctgf(l + ecosf) +e sin" /(1+ecos/)/, 
K =e" sin” f (1 + cos /)? (esin?/ — cos /) — e™' cosec f /. 


Formulas of this type for the undetermined multipliers were first 
obtained by Lawden. 

The continuity of the undetermined multipliers, their explicit representa- 
tions by formulas (3.15) and (3.23), the boundary conditions (3.10) and (3.11), 
the conditions (3.16) for the velocity increments for each propulsion period 
and the transversality conditions (3.12) provide sufficient equations to 
determine all the unknown quantities for a preassigned number of pulses. 


§ 4. Optimal impulsive transfer 


Investigations by a number of authors have shown that power optimal 
transfer trajectories between fixed limit orbits may have one, two or three 
impulsive propulsion periods. Single-impulse transfer is possible if the 
limit orbits intersect. Three-impulse transfer of the Hohmann type, 
consisting of twice-tangent ellipses, become optimal if the ratio of the major 
semi-axes exceeds 11.9. Four-impulse transfer and transfer with a greater 
number of impulses, based on Hohmann ellipses, cannot be optimal. If the 
ratio of the semi-axes is less than 11.9, a two-impulse transfer is an 
optimal Hohmann transfer. Most of the literature deals with two-impulse 
flights. 

The optimality of a two-impulse transfer between coplanar circular orbits 
along an ellipse tangent to both, was proved by Hohmann in 1925. 

In 1959 and 1960 Lawden, Smith, and Long published the results of 
intense investigations /9—11/, in which they constructed, without calculating 
specific flights along limit orbits, optimal two-impulse transfers between 
Spatial circular orbits, coplanar elliptical orbits with trimmed axes and 
a small angle between the lines of apsides, and an optimal two-impulse 
transfer between noncoplanar orbits with small eccentricities and finite 
mutual inclination. However, they were pessimistic as to the problem of 
optimal transfer in the planetary case, in which both the eccentricities of 
the limit orbits and the mutual inclination are small. The difficulty is that 
when the eccentricities of the limit orbits and their mutual inclination are 
made to tend to zero in turn, different limits are obtained. 

Using the technique described in /3/ for finding power optimal impulsive 
transfer, we have succeeded in solving this problem /12/. We start from 
a general spatial formulation of the problem in spherical coordinates. The 
result is as follows. If the order of smallness of the inclination is equal to 
or smaller than that of the eccentricities, the optimal transfer has an 
ascending node near the ascending node of the final orbit. If the order of 
smallness of the inclination is greater than that of the eccentricities, the 
ascending node of the transition orbit is close to the node of the transfer in 
the case of zero inclination. In both cases the optimal transfer is determined 
explicitly up to terms of second-order smallness. 
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This technique has yielded the solution of a number of problems taking 
into account the specific phases of the motion along the limit orbits, and 
the internal mass reserve, which is proportional to the total flight time. 

It was found that even a small internal mass reserve may lead to qualita- 
tively new noncoplanar transfer orbits. 

For example, for arbitrary phases of the motion along circular limit 
orbits (if the ratio of the sum of the characteristic velocities to the effective 
exhaust velocity is between In 5/3 and In 3), the transfer orbit is close to 
that of transfer without internal reserve; if the ratio is smaller than ln 5/3 
or larger than In 3, the node of the transfer orbit lags 90° behind the node 
of the orbit obtained without allowing for the internal mass reserve. There 
inclinations to the initial orbit in these two cases are one order greater 
than the mutual inclination of the limit orbits. The transfer orbits of the 
two indicated bases also differ considerably from each other. 

Consideration of specific phases of motion along the limit orbits in the 
case of spatial transfer also leads to similar, qualitatively new, phenomena. 

Using the theory of variations we can solve problems with nonimpulsive 
propulsion phases. We have solved the problem of nonimpulsive short 
coplanar transfer to a final circular orbit intersecting the initial orbit. 

The solution was assumed to be a power series in the angular velocity 
of the final circular orbit. It turned out that for a solution of first-order 
accuracy, both programmed schedules and schedules of maximum jet 
acceleration can be used. Retaining terms of second-order smallness the 
optimal trajectory is determined uniquely, except for the special case in 
which the residuals of the transversal and radial velocities are equal. 

If the residual in the transversal velocity exceeds that in the radial 
velocity, the optimal trajectory consists of a coasting phase, followed by 
a phase of maximum jet acceleration. The maneuver time is found from the 
optimum conditions and cannot be assigned in advance. 

If the residual in the transversal velocity is smaller than that in the 
radial velocity, the optimal trajectory consists of a propulsion phase with 
maximum jet acceleration, followed by a coasting phase, and a second 
propulsion phase with maximum jet acceleration. The maneuver time can 
be preassigned. 

The problem was solved in finite form, giving explicit expressions the 
times when the engine is switched on. We used Ting-Lu's approximating 
technique for the optimal trajectory /13/, which we generalized to the case 
of the variational problem with bounded controls. The solution is described 
by the author in /14/. 

Remark (added in proof). The construction of extremals used in § 2 is 
not always possible. To prove the necessity of the maximum principle it 
is possible to select a trajectory A&s differing from the optimal trajectory 
only in some internal time interval including the point (x, %, ¢) and tangent 
to the field of extremals corresponding to the optimum controls {us}, or 
intersecting these extremals, at least at two points. The sufficient 
conditions must be expanded considerably. The conditions indicated by the 
author are sufficient in the class of trajectories A8 for which the required 
field of extremals can be constructed. 

The inaccuracies in our argument regarding the sufficient conditions 
were pointed out in the discussion of the paper at the conference by A.I. 
Lur'e, V.V. Rumyantsev, V.A. Troitskii and V.F. Krotov. The inaccuracy 
of our original results regarding the sufficient conditions was indicated by 
L. Neustadt (Mathematical Reviews, Vol. 30, No. 2, 1889. 1965). 
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INVESTIGATION OF THE STABILITY OF MOTION OF 
SOLID BODIES WITH LIQUID- FILLED CAVITIES 


Problems concerning the motion of solid bodies with cavities partially 
or completely filled with liquid are largely stimulated by the diverse needs 
of engineering, but they also are of considerable interest from a theoretical 
point of view, being on the border between dynamics of solids and hydro- 
dynamics. In recent years a number of studies dealing with these problems 
have appeared. Nonlinear methods for investigating the stability of motion 
of such systems, connected mainly with the development of Lyapunov's ideas 
and methods, have been considerably developed. 

The purpose of the three parts of the present paper is to review some 
of these methods and to indicate their applicability. 

In formulating the problem of stability of motion of a solid body witha 
liquid, which is a system with infinitely many degrees of freedom, one is 
first faced with the problem of defining the concept of stability: stability 
in what sense and with respect to what quantities? In his famous thesis /1/, 
when defining stability for systems with finitely many degrees of freedom, 
Lyapunov emphasized the necessity of indicating which quantities are 
assumed to remain small under conditions of stability, since the concept 
is not absolute. 

At present there are three definitions of the stability of motion of liquid- 
filled solid bodies in the nonlinear formulation. 

In the simplest case the motion of the liquid in the cavity is characterized 
by a finite number of variables. It is known that this is possible when the 
cavity is completely filled with an ideal liquid, when the motion of the liquid 
is potential or a uniform rotational motion. In this case it is natural to 
formulate the stability problem as that of Lyapunov stability for systems 
with finitely many degrees of freedom /1, 2/. 

In general, when no assumptions are made on the motion of the liquid in 
the cavity other than the natural assumptions of continuity and compactness, 
the state of the system is described by an infinite number of variables, and 
the stability problem is much more complicated. However, even in this 
case it is possible to formulate a problem of stability with respect toa 
finite number of variables, by considering certain quantities characterizing 
the motion of the liquid in an integral manner. 

Finally, a third definition of stability is connected with Lyapunov's ideas 
/3, 4/ in the theory of equilibrium figures of a rotating liquid; it is related 
to the stability of the form of the liquid. In this case the stability problem 
is reduced to minimizing a certain expression, in general a functional. 

We dealt with the first two approaches to the stability problem in our 
review /9/, and therefore in the present paper we consider them as briefly 
as possible. 


170 


RUMYANTSEV 


I. COMPLETELY FILLED CAVITY; SIMPLEST CASES 


1. Irrotational motion 


Consider a solid body with a simply- or multiply-connected cavity of 
arbitrary form, completely filled with an ideal homogeneous incompressible 
liquid subject to potential mass forces. Suppose the motion of the liquid 
is irrotational, the velocity vector v= grad@®. Due to the incompressibility 
of the liquid, the velocity potential ® is a harmonic function of the 
coordinates, whose derivative in the direction of the normal at the walls 
of the cavity is equal to the normal component of the velocity of the 
corresponding point of the wall, and the circulations along the principal 
contours inside a multiply-connected cavity are given constants &, determined 
by the motion of the liquid in the solid body at rest. 

This case was studied thoroughly by N. E. Zhukovskii /5/, whoestablished 
a number of fundamental theorems. Following Zhukovskii, the function 
is determined by decomposing the motion of the solid body into a translation 
with the velocity of some point ® and a rotation about this point. The 
translational motion has a unique potential function, so that it is commu- 
nicated directly to the liquid and does not influence its internal motion, 
which varies only because of the rotation of the body about the point O. The 
velocity potential, expressed in a moving system of coordinate axes Oxyz 
rigidly attached to the body, is assumed to have the form 


oo 9 


D (x, y, Z, t) = OX + Way + Wz + >} wp; (x, y, 2) + 2 RB; (x, Y, 2), (1.1) 


where w; and ow are the projections of the velocity vector of the point O and 
of the instantaneous angular velocity vector of the body, y, (x, y, z)and 9, (x, y, 2) 
are harmonic functions which satisfy at the walls o of the cavity the 
conditions: 
ay. ay. _ dy 
Th yn — zm, Gn = zh — x0, a mT am—yl, 
d® 


St =0 (J=1,.-..7— 1), (1.2) 


where the function 6, (x, y, z) decreases by unity when passing through the j-th 
partition in the direction of the /-th principal contour and varies continuously 
in passing through the remaining partitions, which can be constructed in 
the cavity in order to make it simply-connected /6/; the components of the 
unit normal vector p are denoted by I.m,n 

The motion of the liquid in the cavity is thus determined by a finite 
number of quantities w, w, ({ =1, 2,3) and & (j= 1,...,7—1), and the motion 
of the whole system (body with liquid) is described by the equations of the 
dynamics of solid bodies together with the Laplace equation. 

The mechanical effect of a liquid which has no initial velocities in the 
body at rest is identical to the effect of some equivalent body attached to 
the given solid body. If the liquid in a multiply-connected cavity has some 
initial irrotational motion, it produces an additional effect similar to that 
of some revolving rotor attached to the solid body. This important theorem 
is due to Zhukovskii /5/. 
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Thus, in the case of irrotational motion of an ideal liquid completely 
filling the cavity of a body, the motion of the body is determined by a finite 
number of variables. In this case, therefore, it is natural to formulate 
the stability problem in the sense of Lyapunov stability for systems with 
finitely many degrees of freedom. N.G. Chetaev formulated and solved 
this problem for a number of cases /7/. 

Example 1 /10/. Let a solid body with liquid move about a stationary 
point O under the action of forces with a force function U (y,)/8/, y; (i = 1, 2, 3) 
denoting the direction cosines of the stationary axis O¢ relative to the moving 
axeS x,y, 2. We assume that the initial motion of the liquid in the multiply- 
connected cavity is such that the only nonzero component of the angular 
momentum of the liquid is that along the z axis, R. Under these conditions 
the equations of motion have a particular solution which describes uniform 
rotation of the solid body about the ¢ axis with angular velocity o, provided 
that (QU/dy5)+,—3 + 0°. 

Let us study the stability of this motion with respect to ow, y, (i = 1, 2, 3). 
First consider the case of a dynamically symmetric body, when A = B&. 

Using the known first integrals of the equations of perturbed motion /1/, we 
follow Chetaev's method /2/ and construct the function 


V = A(oi + 03) + 2AA (ait + Oats) + 2CAEN — 


—[Cor—(F) + Ral Cit A+ M+ Get... (1.3) 


where 4 is a constant; the time-derivative of the function V vanishes by 
virtue of the equations of perturbed motion. Since by the condition V’ = 0. 


(Co+R)+4A(H) >0, (1.4) 


the function V is positive definite, Lyapunov's theorem implies that 
inequality (1.4) is a sufficient condition for stability of the unperturbed 
motion. 

Considering the function VW = o,), —o,y,, we easily see, on the basis of 
Chetaev's instability theorem, that (1.4) is also a necessary condition for 
stability. 

In the case A >8, a sufficient condition for stability can be obtained 
in a Similar way: 


(C— A) o* + Rot (5) >0. (1.5) 


2. Homogeneous rotational motion of the liquid /11/ 


Suppose a homogeneous incompressible ideal liquid completely fills an 
ellipsoidal cavity: 


— — a \ 
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The velocity of the liquid can be represented in the form 
v = grado + Qxr, (1.7) 
where the harmonic function 


(x, Y, 2, fF) = Wx + Way + Wz + 


m—2 


+ ((6% —c*) yz + 2 (c2yoz — b*zay)) e+ + 
+ [(c? — a*) xz + 2 (a*z9u — c*x9z)) 3 + 
+ ((a? — 6%) xy + 2 (b%xoy — a*yox)) H (1.8) 


satisfies the boundary conditions at the walls of the cavity, and the vector 
Q (4 satisfies the Helmholtz equation 


2 + ox = (2-¥)v. (1.9) 


Thus, in this case also the motion of the solid body with the liquid is 
completely characterized by a finite number of variables, which makes it 
possible to formulate the problem of Lyapunov stability for systems with 
finitely many degrees of freedom. 

For examples, see /9, 10, 11, 12/. 


II. STABILITY OF MOTION OF A LIQUID- FILLED 
SOLID BODY WITH RESPECT TO SOME OF THE 
VARIA BLES 


1. On the equations of motion 


Imagine a free or constrained solid body with a cavity partially or 
completely filled with an ideal or viscous liquid. Considering the body 
and the liquid as a single mechanical system subject to given forces, it 
is easy to set up its equations of motion in some form; in general we get a 
system of ordinary and partial differential equations with appropriate initial 
and boundary conditions /13/. 

For example, if the solid body has a fixed point, the equations of motion 
can be written 


1 + 9) + x (G1. + Hs) =L, 
7 + oxv= F — = gradp+ vay, (2.1) 
divv = 0, 
where g, is the momentum of the body (i = 1) and of the liquid (é = 2), L is the 
moment of the external forces applied to the system, F is the vector of 


mass forces acting on the liquid, pis the density, p (x,y,z, t) is the pressure, 
4 is the Laplace operator, p is the coefficient of viscosity, v = p/p. 
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The velocity vector of the liquid is v=v,+u, where u is the velocity 
vector of the liquid relative to the body, v, is the translational velocity 
vector. 

The boundary conditions are: at the cavity walls o, with which the 
viscous liquid is in contact: 


u=0Q, (2.2) 


at the free surface o;, whose equation we suppose to be f (x,y,z, /) = 0, we 
have the kinematic condition 


0//ot + u-grad f =0 (2.3) 
and the dynamic condition 


Po = — Pol. (2.4) 


Here p, denotes the stress vector, n is the unit vector on the external 
normal, p, is the constant air pressure over the free surface. In the case 
of an ideal liquid (v = 0), conditions (2.2) and (2.4) are replaced by 


u,=O0 ato,, p=Ppo at a, (2.5) 


The equations of a solid body with liquid admit, under certain conditions, 
an integral relation and first integrals which may prove useful in stability 
investigations. 

Thus, if the forces acting on the system have a force function U, and the 
constraints imposed on the body are ideal and stationary, the following 
equation is easily obtained: 


for =— eS [RY + C+ C2) ]+ 


do. Oo; \3 do Ov \3 dv. Ov, \3 
+(G+ BN +4 SY + B+ By 2.8) 
where + is the volume of the liquid. 
Hence it follows that motion of a viscous liquid without dissipation of 

energy is possible only if in the whole liquid 

dr, _ Oo Om, O09 , Oe, Ov dvs __ ao. Qo. 

ya Btatatant at yH% (2.7) 
i.e., if the liquid and the body move as a single rigid body. For an ideal 
liquid, equation (2.6) yields the energy integral 


T—U=h. (2.8) 


If the moment of the external forces acting on the system is zéro relative 
to some stationary axis, say the {-axis, then the component of the angular 
momentum g, of the system along this axis remains constant during the 
entire motion. 

If the ellipsoid of inertia of the body and the cavity isa solid of revolution, 
say about the z-axis, and the moment of the external forces with respect to 
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this axis is zero, then in the case of an ideal liquid the component of the 
instantaneous angular velocity of the body along this axis, o,, remains 
constant /13/. 


2. Stability with respect to some of the variables 


The position of the solid body and the liquid relative to a stationary 
system of coordinates axes 0,§n{ can be defined by the Lagrangian coordinates 
of the body q,(j = 1,...n<« 6) and absolute &,7n, ¢ or relative x,y, z coordinates 
of the liquid particles, which are infinite in number. 

Suppose the equations of motion have some particular solution 


Vahl, H=ht) (j=l... a) 
vo, = F,(x,y,z,t) (i=1, 2, 3), (2.9) 
p= F,(x, y, z, %), 


which satisfies certain initial conditions 


die = Fy (to)s Mme = fy (to), 
01 (X,Y, 2, te) = @ (x, ¥, 2) (2.10) 


(where ?, is the initial instant), and also boundary conditions. 

The solution (2.9) describes some motion of the solid body with the liquid; 
let us take this as the unperturbed motion and investigate its stability. 
Suppose that for the perturbed motion we have, instead of (2.10), initial 
conditions 


Gp =S(to) +e 9 = 1) (te) + 8p 


(2.11) 
Us = O (X,Y, Z) + 8 (x, Y, 2), 
where e, and e, are real constants, while se, (x, y, 2) are real functions 
satisfying the equation of continuity, which we shall call perturbations. 
We assume the perturbations to be sufficiently small in absolute value. 
We obtain the equations of perturbed motion from the original equations 
of motion of the system by substituting 


V=hOtsp YV=hO+sp 
oO, = F,(x,y,2,t) +(x, 9,2, ¢), p= Fe(x,y,2,t)+ p(x, y, 2, 0). 


(2.12) 


If it turns out that for any perturbations ¢,, e,, 6 (x, y, 2) of sufficiently 
small absolute value the deviations or variations of x,, x; (= 1,...,A), 
u; (i= 1, 2, 3), p* are arbitrarily small in absolute value for any ¢>4%, then 
the unperturbed motion (2.9) is stable with respect to q, 9), wand p. 

However, in this formulation the problem presents tremendous 
difficulties, and at present the only known methods for its solution are 
those of the theory of small oscillations. 

These difficulties may be avoided by varying the formulation of the 
problem. The point is that in applied problems of stability of motion of 
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solid bodies with liquid-filled cavities we are interested mainly in the 
stability of motion of the solid body; the stability of motion of the liquid in 
the cavity is of interest only inasmuch as it affects the stability of motion 
of the solid body. Of course, these two aspects of the single stability 
problem are interrelated: the motion of the body depends on that of the 
liquid, and vice versa. 

It is therefore natural to formulate the problem in the Lyapunov sense 
with respect to the variables which determine the motion of the solid body, 
and to some quantities 


a= \o, (x, Y, Z, Uy, 0g, Us) at, 


t 


which characterize the motion of the liquid in an integral manner. Here 

®, (x, y, Z, 0;, Ug, Ug) are certain real, continuous, bounded functions. Thus, 
for example, it follows from equations (2.1) that for given external forces 
acting on the system, the motion of the solid body depends on the angular 
momentum of the liquid and its rate of variation with time, which in turn 
depend on the motion of the body. Inthis problem, therefore, an appropriate 
choice for the p, is the components of the angular momentum of the liquid. 

We note that the p,—the momentum, angular momentum of the liquid, 
etc.—do not characterize the motion of the liquid (which depends on an 
infinite number of variables) completely, but only in an integral manner. 
This being so, the stability of motion of the liquid with respect to the p,, 
is conditional stability—the stability of motion of a continuous medium with 
respect to some of the infinitely many variables characterizing its motion. 

In this approach, our problem, which refers to a system with infinitely 
many degrees of freedom, is reduced to the investigation of stability with 
respect to a finite number of quantities q, 4),p,, and this ensures stability 
of motion of the solid body with respect to q,, q'. 

Using this formulation of the problem /14/, the methods of the theory of 
stability of systems with finitely many degrees of freedom, in particular, 
Lyapunov's second method, may be applied to its solution. 

Consider some given continuous real functions Q, of the quantities 
9p 9 p,and the time ¢. For the unperturbed motion, the functions Q,, after 
substituting (2.9), are transformed into certain known functions of time F, (0, 
and for the perturbed motion (2.12) (maintaining the previous notation Q,) 
they are functions of the time and the variables Xp x, and u,, which depend 
in turn on the perturbations e,, ei, e,(x,y,z). Consider the differences y,=Q,—F,, 
which depend on the time ¢ and the variables x, x, and u,. Let L, be 
arbitrary positive numbers. If for any L,, however small, there exist 
positive numbers ££’, E, such that for any perturbations e, e, e (x,y, z) 
satisfying the conditions le |< &, |el<e,, le, (x, ¥,2)| <Q £, and for any t>%, 
we have |Q, —F,| <L,, then the unperturbed motion is stable with respect 
to the quantities Q,, and unstable otherwise /1/. 

We present a theorem /15/ which is useful in some cases as a modifica- 
tion of Lyapunov's stability theorem. 

Suppose there exists a real, continuous, bounded, single-valued function 
@ (t, Xp Xj» Uy) which vanishes for x, = x,=u,=0, whose time-derivative ¢@ < 0 


] 
by virtue of the equations of perturbed motion. Suppose that for all relevant 
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values of the variables ¢, x, x, u, and the corresponding values of the 
variables y, we have the inequality 


W (ty Yar o-+ 1 Ya) S(t Xp Xp tt), (2.13) 


where Y¥ (ft, y-) is a real, continuous, bounded, single-valued function which 
vanishes when all y, =0O(r=I1,..., &). 

Theorem 2.1. If the differential equations of perturbed motion are 
such that there exists a function 9 (t, x, x\, u,) whose time derivative 9 < 0, 
anda positive definite function ¥ (t, y,,..., ys) such that inequality (2.13) holds, 
then the unperturbed motion ts stable with respect to Q,(r=1,..., &). 

Example 2. Consider a solid body with a cavity completely filled with 
a viscous liquid, moving about a fixed point O due to forces with a force 
function U (y,)/16/. Included among the real motions are uniform rotations 
of the entire system as a single rigid body about a stationary axis O0¢. The 
problem of stability of such motions with respect to the quantities @,, Gy, 
vy: (§ = 1, 2,3) can be solved by considering the function 


V = A,o) + Bos + C,E* — 20 [(A,01 + Gar) ¥11 + 
+ (Bios + Gn) ta + (CE +I + G G+ GF Gat 


dvs (cot (Y)eterrmea-Gae aw 


where the constant 4 > (@U/dy3), — (0U/d,);. For the function V to be positive 
definite, it is necessary and sufficient that the following inequality be 
satisfied: 


(C—A)o* + (8U/a%),>0 (A> B). (2.15) 


It is easily seen /10/ that the conditions of Theorem (2.1) are satisfied, so 
that inequality (2.15) is a sufficient condition for the stability of the 
unperturbed motion. 

In a similar formulation, N.N. Kolesnikov /17/ solved the problem of 
the stability of motion of a free solid body with a cavity filled with a viscous 
liquid in a Newtonian central force field. He proved that the conditions 


C>A>8B (2.16) 
are sufficient for the motion of the body along a circular orbit to be stable. 


Conditions (2.16) have the same form as those established by V. V. Beletskii 
/18/ for the case of a solid body alone. 


Ill. STABILITY OF STEADY-STATE MOTIONS OF 
A LIQUID-FILLED SOLID BODY 


1. Some formulas 


Consider a solid body having a simply-connected cavity partially or 
completely filled with a homogeneous incompressible ideal liquid. Suppose 
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that stationary ideal constraints are imposed on the body, given forces 
applied to the body, and the mass forces acting on the liquid particles have 
force functions U, and U, which do not depend explicitly on the time. As 
before, we shall assume that the motions of both the body and the liquid are 
continuous, so that the coordinates of each particle of the liquid are. 
continuous functions of their initial values and time. 

Under these conditions the differential equations of motion of the system 
have an energy integral 


where V is the potential energy of the forces acting on the system, and A 
is an integration constant. 

The position of the solid body relative to a system of coordinates 0,§nf 
is defined by its Lagrangian coordinates q,,...,9,(an <6). The potential 
energy V of the system is in general a function of the coordinates 
Gu +++» 9 (fF <a) and the form of the liquid inside the cavity. 

We also assume that the constraints imposed on the body permit rotation 
of the whole system as a single rigid body about some fixed axis, about 
which the forces acting on the system have no moment. Then the potential 
energy V of the system obviously does not depend on the angle of rotation g, 
of the body about the fixed axis. Under these conditions there exists an 
area integral for the plane orthogonal to this axis. Taking this axis as the 
axis O,¢ of the stationary coordinate system, the area integral has the form 


Ge =k. (3.2) 


Together with the stationary coordinate system we introduce a system 
of coordinate axes 0,§,n,f rotating about the €-axis with some angular 
velocity w. We denote by v (0, 9, u,) the absolute velocity vector of any 
given point of the body or liquid with radius-vector r (f,n, ¢), and by u (u, o, a) 
the velocity vector of the same point relative to the system 0,§,,C; then 
v=u+oxr. 

The kinetic energy and the €-component of the angular momentum of the 
system can then be represented in the form 


THT, +0KR+ FS, H=R+oS, (3.3) 
where 


T,= + Dm. (us + 03 + w?), GF = J ms (Ewe — Notte) 


The angular velocity o of the rotating system 0,§,n,¢ may be assigned 
arbitrarily; we agree to choose it /4/ so that at any instant the ¢-component 
of the relative angular momentum of the system is zero #& =0, which, by 
virtue of (3.2) and (3.3), is equivalent to the equation 


oS = k. (3.4) 
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With this choice of w, the energy integral (3.1) can be represented in 
the form 


T.+5—t+VHh. (3.5) 


Under the above assumptions regarding the constraints and the forces, 
the system may indeed perform uniform rotations as a single rigid body 
about the stationary axis ¢. In this case the system is in equilibrium with 
respect to the system 0,{,n,t, which rotates about the ¢ axis at uniform 
angular velocity a). 

The general equation of dynamics then leads to the equality 


su =0, U=+-alS—V, (3.6) 


where the symbol 6 denotes variation corresponding to virtual displacements 
of the system which are compatible with the constraints. But the expression 
for U may be regarded as the force function of the given forces and the 
centrifugal inertial forces. By the principle of virtual displacements, 
equality (3.6) represents the condition of equilibrium of the system with 
respect to the coordinate axes 0,§,,0, if the latter rotate at constant angular 
velocity w,. 

Let us introduce the expression 


eum 
2 


ula 


Vv = +YV, (3.7) 
where & is the value of the area constant & for uniform rotation of the whole 
system as a single rigid body about the €-axis at angular velocity w,. The 
variation of this expression corresponding to virtual displacements of the 


a 
system is 6WV ee “0 86S + 6V, where S, is the value of S for steady-state 


2 
motion. Se 
Comparing 6W with 6U and using the fact that o,S, = &, we see that (3.6) 
is equivalent to the equation 


sw =0. (3.8) 


Thus, in the case of steady-state motion of the system, expression (3.7) 
has an extremal (stationary) value. 

According to its definition (3.7), W depends on the same coordinates of 
the body q;,...,4a-. aS Sand V, on the form of the liquid and on the value 
of the constant &. Ne 

Condition (3.8) leads, as is easily seen, to the equations 


4 1 39S ov 


for the coordinates q of the solid body in steady-state motion, and also 
to equations for the pressure in the liquid, which yield the equation 


08 (88 + n*) + Ua(8, n, 0) == const (3.10) 
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of its free surface in this motion, if the liquid does not fill the cavity 
completely. 

For a fixed value of the parameter &,, equations (3.9) and (3.10) determine 
the coordinates of the solid body and the form of the free surface of the 
liquid in steady-state motion. 


2. Stability theorem 


Consider some steady-state motion of the system corresponding to a 
given value of the area constant &. Without loss of generality, let us 
assume that the roots of equations (3.9) for the given value of & are 
qa=O0O(%=1,...,a —1). In this case the liquid has a form of relative 
equilibrium f,, bounded by the free surface o, defined by equation (3.10) 
and the cavity walls with which the liquid is in contact. 

Let us investigate the stability of this steady-state motion using the 
energy integral (3.5). 

If the liquid completely fills the cavity, stability of motion will mean 
stability in the Lyapunov sense /1/, with respect to the noncyclic coordinates 


Qi, ---» 9a. Of the body on which the potential energy V and the moment of 
inertia S of the system depend explicitly, the generalized velocities 
g,.---»q, and the kinetic energy T” of the liquid. 


If the cavity is only partially full and the liquid has a free surface, the 
case is more complicated. As shown by Lyapunov, /3/, the energy 
integral does not ensure, in general, that the liquid, after its state of rest 
(absolute or relative) has been disturbed, have the type of motion which 
serves as a criterion for stable equilibrium in the mechanics of systems 
with finitely many degrees of freedom. Lyapunov established that the 
difficulty disappears only if the stable equilibrium form is defined in such 
a way that, after imparting to the liquid sufficiently small perturbations, 
its form remains only slightly different from this equilibrium form, unless 
arbitrarily thin string or leaf projections form on the surface of the liquid. 

We adopt this definition and, following Lyapunov, introduce some of the 
relevant concepts as applied to our problem. We shall compare the relative 
equilibrium form of the liquid /, with its form f at any instant of the 
perturbed motion, disregarding the motion of the liquid particles themselves 
but taking the kinetic energy of the liquid into account. The form f is 
bounded by the free surface o,; of the liquid and the cavity walls with which 
the liquid is in contact at the given instant. For perturbed motion sufficiently 
close to the unperturbed motion, in a system of coordinates xyz rigidly 
attached to the body, the forms f, and f differ only in the free surfaces o, 
and o,. Owing to the incompressibility of the liquid, the volume of the form 
f is of course equal to that of the form f/f. 

Consider some point P of the surface o,7 and the point P, nearest to it on 
the surface o,. As the position of the point P varies on the surface go, the 
distance PP, also varies, and for some position of the point P becomes the 
greatest possible distance for the given instant. Lyapunov called this 
maximum distance PP, the separation; let us denote it by /. We also 
introduce the deviation V of the liquid form / from the equilibrium form f,, 
defining it as the volume of the part of the form / outside the form f,, or, 
equivalently, the volume of the part of f, outside f. 
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It is obvious that for a given separation /, the deviation V has a certain 
maximum of the form (ij), where »(f is a positive function none of whose 
values exceed a certain limit. For / not exceeding some given number A, 
the function »(f has a nonzero minimum. The minimum deviation V for 
a given value of / is always zero /4/. 

For continuous motion of the body and liquid the separation / and the 
deviation V are obviously continuous functions of time. 

If for any arbitrarily small positive numbers L,andL,, there exists a 
positive number 4 such that for any initial values of the coordinates q, 
and the generalized velocities 9), (i= 1,...,a —l)of the body, the separation 
lb, the deviation Vo and the relative velocities of the liquid us, u, ma, 
satisfying conditions 


Gla, 19,14, [lol A, Vo > ela, 


(3.11) 
[uol< a, [OlQa, Jmol<a, 
and for any ¢>4, or, at least, for 
V>el (3.12) 
we have inequalities 
Igel<Lay JE<Li Nall ITP I< Ls (3.13) 


then the unperturbed steady-state motion of the solid body with liquid is 
called stable; otherwise unstable. 

Here e denotes a positive number smaller than the minimum of the function 
» (9 under the condition |/|<L,, and the quantity e/ can be regarded as the 
possible deviation of the liquid. Note that if the liquid completely fills the 
cavity, the condition connected with (3.12) is omitted. 

We also need the concept of the minimum for W. If Wis a function 
W (9: .. +, Ga-1), its minimum for a fixed value of the constant & is an isolated 
minimum with respect to those variables q,,...... »Yn-. ON which it explicitly 
depends. If the liquid does not fill the cavity completely, then, following 
Lyapunov /3/, we use the following definition of the minimum for VW. 


If for the steady-state motion in question (q,=0 (i= 1,...,.a —1}), 
{=0, V=0) W,is the minimum of W, then there exists a sufficiently small 
positive number €& such that for all values of the coordinates gq, (i= 1,....a2 —1), 


the separation / and the deviation V, such that |q,| < E,|/|< £. y >el, where 

e is a positive number smaller than the minimum of the function »() under 

the condition |/| <q —, the difference W—W, is always positive, vanishing only 
for gq=—0,/=0,7=90. 

We note that for any given value of /, the difference YW — WY, may be made 
arbitrarily small by appropriate selection of a position of the body and a 
form of the liquid for which |q,| and V are sufficiently small. But the limit 
case, in which for /#0 all g¢=O0O((=1,...,a —1), V, and consequently also 
W — W, vanish is obviously impossible, if only forms which the liquid can 
assume are considered. The condition y > e/ was introduced to eliminate 
this difficulty. 
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3 
Theorem 3.1. If V= = +V has an isolated minimum W, for the 
steady-state motion of a solid body with a liquid-filled cavity, the 
unperturbed motion ts stable. 
The simple proof of this theorem /19/ is based on the use of the energy 
integral (3.5), which we write in the form 


pe _ 
T+ We aT WO 4 (3.14) 


where (0) denotes the initial value of the respective quantity; &is the value 
of the area constant for the perturbed motion. 

Remark 1. Lyapunov /4/ pointed out that in order to characterize 
the difference between the perturbed and the unperturbed form of the liquid 
it is possible to replace the separation i by some other quantities which 
may vanish only for the unperturbed form. One may, for example, take the 
deviation V, and prove as before that if, for the considered steady-state 
motion of a solid body with a liquid partially filling the cavity, W has a 
minimum VW, in the sense that VW — VW, > 0 for all values of |q,j and V not 
simultaneously zero and smaller than a fixed constant, then (for sufficiently 
small perturbations) 9g, q (§=1,...,a—1),V and 7 remain smaller than 
any preassigned, arbitrarily small, constants during the entire motion. 

Corollary. If the potential energy V of the system has an isolated 
minimum V, for the equilibrium position of the solid body with a liquid-filled 
cavity, (for & = 0) the equilibrium position is stable /20/. We note that 
this result is also applicable to the case of relative equilibrium of a solid 
body with a liquid-filled cavity. 

Suppose that for some fixed value of the parameter W has a minimum, 
i.e., the steady-state motion is stable. Let us now vary the parameter &, 
continuously, so that the roots of equations (3.9) describe some branch C 
of the 'equilibrium'' curve. Theexpressionfor V will then vary continuously; 
for all points of the curve C for which W remains a minimum, the steady- 
state motions are stable. A change of stability on this branch may occur 
only at bifurcation points. 

The above theorems remain valid for a viscous liquid. For example, 
in the proof of Theorem 3.1 we need only replace equation (3.14) by the 
inequality 


T.4V4+5 


ho — ke 1 ¢—# 
5 0 STP + V%4+5— (3.15) 


changing nothing in the proof. Moreover, under the conditions of Theorem 
3.1 it can be proved that any perturbed motion sufficiently close to the 
unperturbed motion tends in the limit to the steady-state motion of the 
system as a single rigid body. If, however, for an isolated steady-state 
motion of a solid body with a cavity filled with a viscous liquid VW has no 
minimum, this motion is unstable. 

Hitherto we have not taken the surface tension of the liquid into consider- 
ation. In a number of cases, however, particularly in conditions of 
weightlessness of "low gravity’, this factor may be important. 

The theorems proved in this section remain valid also when the surface 
tension of the liquid is taken into account, if the potential energy of the 
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surface tension forces ao+ a,o, +a,0, is addedtothe criterion functionof the 
stability problem. Here a,a,,a, denote the coefficients of surface tension, 
o,0,,06, are the areas of the free surface of the liquid and the surfaces of 
contact of the liquid and air with the walls of the cavity. Then, for example, 
a theorem analogous to 3.1 may be formulated as follows. 

If the expression 


2? 
W = 5-2 +V +054 a0; + 0405 (3.16) 


has an isolated minimum W, for steady-state motion of a solid body with 
a liquid-filled cavity, the unperturbed motion is stable. 
The proof is completely analogous to that of Theorem 3.1. 


3. The minimum problem 


We now proceed to the solution of the minimum problem. Since the 
minimum property is local in the sense that values of the W at a given point 
are compared with its values at all points of a sufficiently small neighbor- 
hood, a rigorous solution of the minimum problem, excluding some special 
cases, may be confined to consideration of second-order quantities. The 
methods of the theory of small oscillations are thus very convenient, if the 
displacement / of the free surface from its equilibrium position is 
represented as a series in a system of eigenfunctions of the corresponding 
boundary-value problem. This method was used, for example, by N.N. 
Moiseev /21/ and G.S. Narimanov /22/ to solve the problem of the 
minimum potential energy of a heavy pendulum containing a cavity filled 
with a heavy liquid, and by V.V. Rumyantsev /19/ to solve the problem 
of the minimum of W for a heavy solid body containing a cavity filled with 
a weightless liquid. 

However, the calculations are very tedious since one must operate with 
infinite series, multiply them, integrate them over certain regions, etc. 

In this respect, another method, the main idea of which is due to G. K. 
Pozharitskii /23/, is more convenient. He pointed out that the potential 
energy of a heavy liquid has a minimum when the container deviates from 
the equilibrium position, if the free surface of the liquid is a horizontal 
plane. 

We describe the solution of the minimum problem for W/24/. 

In the neighborhood of the steady-state motion of a solid body witha 


liquid, for which gq =0(i=1,...,a— 1), consider a region of the variables 
la|<H, (3.17) 
where H >Ois a sufficiently small constant. Let g(i=1...... ,a —1)be 


the coordinates of some fixed points of the region (3.17). Let us find the 
form of the free surface of the liquid such that W has an extremum for the 
given fixed g,. To solve this problem, we find the first variation of W for 
fixed g, and equate it to zero: 


I sn 
ow = — el [> OG +m) + UG, n, ¢)]de = 0, (3.18) 
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where S is the moment of inertia of the system with respect to the €-axis 
for the given q, and the required form of the free surface of the liquid. 
Varying the integrand and using the equation of continuity and the boundary 
conditions for the liquid, we find that (3.18) is possible only if the free 
liquid surface has the form 


mc (&* + n*) + Us(E, n, 6) = const. (3.19) 


For steady-state motion of the system, when q,=0 (i= 1, 
equation (3.19) coincides with equation (3.10). Under fairly general 
assumptions on the smoothness of the cavity walls, the liquid level and 
the character of the mass forces acting on the liquid, it can be shown /24/ 
that if the free surface of the liquid is defined by (3.19) for given fixed y,, 
then W has a minimum. 

For any given values of qg, in the region (3.17), a solid body with a liquid- 
filled cavity can be associated with a certain solid body, which we call 
transformed, consisting of the given solid body and the frozen liquid with 
free surface (3.19). Then according to the above, W is a minimum for the 
transformed body in comparison with all possible free surfaces of the liquid 
sufficiently close to (3.19). 

Theorem 3.2. In order that W have a minimum in steady-state motion 
of a solid body with a liquid-filled cavity, it is necessary and sufficient that 
W have a minimum for q,=0 for the transformed solid bodies in the region 
(3.17). 

Thus, the minimum problem for W is reduced to the minimum problem 
for a function of a finite number of variables —to the case of W for a solid 
body with a liquid bounded by the cavity walls o, and the free surface (3.19). 

Let us find the variation of W for the transformed solid body in passing 
from a position corresponding to steady-state motion of the system for 
g: = 0 to a perturbed position in the region (3.17). We can conceive of this 
transition as proceeding in two stages: 1) displacement of the whole system, 
as a single rigid body, to a perturbed position; 2) deformation of the form 
f, of the liquid (by covering its free surface with a layer +t, of zero volume) 
into a form / with free surface (3.19). 

The increment of W may then be represented in the form 


AW =4,V7 + A,W, (3.20) 
where A, denotes the increment when the whole system is displaced to the 
perturbed position as a single rigid body, and A, is the increment due to 
the subsequent deformation of the free surface of the liquid into the surface 


(3.19). Similarly, AS = A,S + 4,S. 
Up to second-order accuracy in q, we have 


8 
Aa = —p | [5 okie + nt) + Us]de + se U(4eS)* + 20S: O95] + 4 


where the zero index refers to the unperturbed position of the system. 
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To calculate A,W, it is convenient to introduce a moving coordinate 
system xyz, rigidly attached to the solid body, whose z-axis in the unper- 
turbed position coincides, say, with the (-axis. 

The integrand in the expression for A,W, in terms of the variables x, y, z, 
will be denoted by ® (x, y, z, 9). The equation of the free surface of the 
solidified liquid (3.10) in terms of the variables x,y and z is 


@ (x, y, Z, 0) = 0} (x* + y") + Unie, ¥, 2) =e. (3.21) 


The value of the constant ¢c is determined by the amount of liquid in the 
cavity. We assume that equation (3.21) has a unique solution with respect 
to one of the variables x, y, and z (for example, with respect to z); for this 
it is sufficient that at no point of the surface (3.21) d@/dz %0. Let us denote 
by Qthe region of the xy plane bounded by the projection of a closed curve 
representing the locus of the intersection points of the surface (3.21) with 
the cavity walls o,. 

Expressed in terms of the moving axes, the surface (3.19) assumes the 
form 


a, (x, y, 2, qu) =, (3.22) 


where the constant c,=c-+ Ac is determined from the equality of the liquid 
volumes in the cavity, having free surfaces (3.21) and (3.22). The latter 
condition is equivalent to the fact that the volume of the deforming layer 

t, is zero: 


(dr =0. 


In the first approximation this equation is equivalent to the equation 


hae ay {de = 0, 
ze 


where z and z, denote, respectively, the values of the variable z for points 
of the surface (3.21) and (3.22). Replacing the variable z by a new variable 
p= OD (x,y,z, 9) —c, the above equation can be represented with the same 
accuracy in the form 


(\(35), (ti — Ho) dx dy = 0, (3.23) 
Q 


where, to within first-order accuracy, 


Aa—i 


Ho = D(x, y, 20,91) —C= YF (a9; Jat s+ 


lei 


3 
hi = D(x, y, zi, qi) —¢ = be + (x + YAS +..., (3.24) 
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since in the first approximation the functions @® (x, y, z. q,)) and Q, (x, y, 2, 4) 
differ only by the term 8 (2 +y) AS. Substituting the values of p, and yp, in 


(3.23) we obtain a linear equation connecting Ac with A,S. A second similar 
equation is obtained by calculating, in the first approximation, the quantities 


AeS = p(t? + nt)de = NN (5a), * + 9) (Wi — Mo) dx dy. (3.25) 
Equations (3.23) and (3.25), in conjunction with (3.24), yield Ac and A,S 
as functions of g. We thus obtain 


iW = —-7 ell (35), — mid de dy + 35, [(AaS)* + 2015-428] +... 


By formula (3.20), AW is represented as a quadratic form in the 
variables q,,...,9n-;. The conditions for positiveness of the latter 
are conditions for a minimum of W for a solid body with a liquid-filled 
cavity in a field of external forces with potential energy V. 

If AW is not definite, W does not have a minimum. 

Setting & = 0 in these formulas, we obtain /23/ the solution of the problem 
of minimum potential energy V for a solid body with a liquid. We mention 
that theorems analogous to 3.1 and 3.2, referring only to the equilibrium of 
a system subject to a gravitational force, were first established inthe linear 
formulation by N.N. Moiseev /25/. 

Example 3 /24/. Ina uniform gravitational field, consider a solid 
body with a fixed point, having a cavity partially filled with liquid. Let us 
investigate the stability of uniform rotation of the whole system as a single 
rigid body about the principal central axis of inertia z, which coincides 
with the vertical &. 

We assume the region Q to be an annulus bounded by circles with radii 
R, and R,(R, >R,). The conditions for minimum W reduce in this case to 
the single inequality 


(Co— Ao) o* — Mgzo—a > 0 (Ao > Bo), 
where 


a= npg j [> ($A-<) + i} pate 
\ 


For other examples, see /23, 24, 26/. 
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S.N. Shimanov 
STABILITY OF SYSTEMS WITH DELAY 


This paper is a review of the literature on the stability of motion of 
systems described by differential equations with aftereffect or time delay. 
Our discussion is centered around generalizations of the well-known 
classical theory of Lyapunov stability for systems described by ordinary 
differential equations /1, 2, 3, 4/. To this category belong investigations by 
means of functionals, the general approach to the analysis of delay equations 
in function spaces, stability in the first approximation, the theory of the 
critical cases, and stability theory of linear and nonlinear periodic systems. 
We refer in the main to published work of Soviet and non-Soviet authors and 
to some new results of a general character in stability theory of periodic 
systems. | 

Interest in the development of the theory of systems with delay, parti- 
cularly stability theory of such systems, stems not only from the desire 
to generalize Lyapunov's method to increasingly more general objects of 
investigation, but also from practical needs. It is well-known that automatic 
control systems contain units with time delay. The delay may be in the 
controlled object, in the controller and in the feedback. Delay exists in 
servomechanisms with long communication lines. Finally, delay differential 
equations are convenient tools for mechanical and electrical systems with 
distributed elements or elements with many degrees of freedom. Recently, 
delay differential equations have been applied in control theory, when an 
uncontrolled system of ordinary differential equations becomes controllable 
if it is capable of storing information over some time interval; the motion 
of the system is then described by a system with aftereffect. 

The development of the theory of delay systems and, in particular, of 
the theory of stability, is thus conditioned by practical needs. 


1. Methods of investigation. General approach to the problem 
Consider a general system of delay differential equations in the form 


dx(t) 2 ¢ 
=) {xj (f +0) dry (8) + Xr (41 (E+ 0), + tall + 8) (i=1,..-.2), 


jm —t 


where the integrals are Stieltjes integrals /4, p. 159/, m,, (@) are functions 
of bounded variation, X, (x, (0),..., x. (8)) are nonlinear functionals defined 
on piecewise-continuous functions (x; (6)} of @ in the interval +< 6 <0, 
and representing nonlinear perturbations. 
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More precisely, the X; satisfy Lipschitz conditions in x,: 
[Xi (25 (B)s 6 25 2% (8) — Xi (2, (8), «2 (8) | <LI x? (8) — x! (8) es 
fx (0) fe = sup (121 (8)[,..., [20 (8) |, —tT< 8<0), (1.2) 
L = Ly {§x" (8) Je + Dx (8) fa". 


where L,,a, are positive numbers, X;,(0,...,0) = 0. 

The motion x= 0 will be called the unperturbed motion of system (1.1). 
Definitions of stability, asymptotic stability and instability of unperturbed 
motion are given as usual in terms of the norm |x (®) |l-. 

It is well-known that Lyapunov's second method (the method of Lyapunov 
functions) is the basic method for solving stability problems. The first 
attempts toapply it to delay systems were unsuccessful /5/. 

A modification of the method of Lyapunov functions, in which the function 
is required to be negative definite on a more restricted set of curves 
satisfying some additional condition, turned out to be more effective /6, 7/. 
The main shortcoming of Lyapunov functions in investigations of the stability 
of delay systems is their nonuniversality, and the irreversibility of the 
stability and asymptotic stability theorems. 

The specific features of equations with time delay or of more general 
delay differential equations were first taken into account by replacing 
Lyapunov functions by Lyapunov functionals defined in some function space, 
such as C (the space of continuous functions). Applied to solutions in the 
space of continuous functions, the theorems of Lyapunov's second method 
turned out to be a universal means of investigation, due to their reversibility 
/4/. 

A number of fundamental results concerning the stability of systems with 
aftereffect were obtained by the universal method of Lyapunov "functionals" 
/4/. Delay differential equations (i.e. , equations with aftereffect) are 
functional equations, since they determine the time derivatives of the 
required quantities x, () at an instant ¢ as functions of their values not only 
at the instant ¢, but also at instants ¢ —t, preceding ¢ in the delay interval 
{¢ —+, t) (the time derivatives of the quantities x, (4 are functionals on the 
interval {x,(¢+ ®). —t*<@< 0},t being the delay). Therefore, as an 
element of the solution it is natural to take a segment of the integral 
trajectory of the delay system, and then to consider the solution itself in 
the space of continuous functions C [—r1,0], x (8) = {x,(@), f=1,...,.2, —t<q 0 & 0} 
with norm |x (6)§. = sup (|x, (8)|,-.-, [2% (I. —tT< 8 < 0). 

This approach to delay systems was suggested in /4, 8/. 

Thus, as an element of the solution of system (1.1) we shall take a 
vector-segment of the trajectory x(x (wo), %,¢+%), —t<6é<0, rather than 
the vector-function of the time x (x, (6),4,4. Calculating the first derivative 
of x (xq (8), to, ¢ + 6) with respect to the time ¢ for @<0, we find 


dx, (xo (8), fo. + 0) _ dx, (Xo (8), to, £-+ 4) : 
dt — dé ° 


At—>+0, —r<6<0. 
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The system of equations (1.1) in the functional space is therefore 
associated with an equivalent system of ''ordinary differential equations’ 
with an operator right-hand side /4, p. 162; 8/: 


sa (0) = Ax; (8) + R (x: (9)), (1.2) 
where 
x4 (8) = x(t + 0) = (x-((4+ 8), —1<0<0}, 
a) | _ coco 
Ax(®)=4, (k= 1,...,n) (1.3) 
D § 21(0)dnu (0), 6 =0 
j=1 —t 
0, —tr<et< 0 
%)) = 1.4 
es (emer @=0. (1.4) 


The fact that A is an unbounded linear operator is no obstacle to the use 
of system (1.2) as a convenient working tool. This approach to delay 
differential equations and the method of functionals has been found effective 
in investigating delay systems, and has led to considerable advances in 
stability theory for this type of system /4, 8/. 

As an example of this approach, we discuss in more detail some 
supplements to the theory of linear stationary delay equations. 


2. The theory of linear delay differential equations 


Consider a system of linear delay differential equations in the form 


=> \ x(f+)dnu (0) (5=1,.--0n), (2.1) 


lel —t 


dx, (1) 
mz 


where the integrals on the right-hand side are again Stieltjes integrals, 
nu (0) are functions of bounded variation. A particular case of system (2.1) 
is the delay system 


M2 =D) (uti (t) + buxi(t—1)) (S=1,.. 200). (2.2) 


ies} 


The latter is (2.1) with n,,; (0) = as, ny (—*) = 5,,, 1, (0) = 0, —t<CO0< 0. Linear 
delay equations are studied in detail in /9/. 

The system of "ordinary differential equations" with operator right-hand 
side equivalent to (2.1) is: 


a= = Ax:(), (2.3) 
where the operator A is defined by (1.3). For system (2.2) the operator A 
has the form Ax (6) = {dx, (0/d0 for +<q09<0, ax ()+ 6x(—1) for 0 = 0}. 
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Together with system (2.3), we consider the adjoint system /10, 11, 12/ 


dy, (0) 
—— = — A’y, (0), (2.4 
where dt (8) ) 


yr: (0) = yy + 9) = (y ¢ + 8),7 > OS O}; 


dy, (0) r>0>0 
dé 9 
— A’y (9) = — (A= 1,....A) (2.5) 


— Ff :() dna (— 4), & = 0 


j=l ¢ 


For system (2.2), the adjoint operator — A° is defined by 


dy, (8) 
dé 


— A*y (0) = , t>O>0, —a'y(0)— y(t) for o=0}. 


System (2.4) corresponds to a system of differential equations with time 
advance of the form 


“6 =— > lt + 0) dn. (— 0) (s=1,...,A), (2.6) 


=i 


~~ 


The spectrum of the operator A consists of eigenvalues 4, (j = 1, 2,3,...) — 
the roots of the characteristic equation 


A (A) =|—EA+ ( edn (8)| =0. (2.7) 


— 


The eigenvalues of the operator —A* satisfy the relation p, = —A, 
(o = 1,2,...). 
We define the scalar product of two vectors 


x (8) (—t C0<0) and yg (6) («> 6 > 0): 


a n © & 
(0), 9(0) = J 4/0) 0 — SQ [)xccory(—O + Bae] anu (0). (2.8) 
im fjma —t 0 


A direct calculation establishes the identity 
(Ax(®), y (O)) = (x(6), A°y (6)) (2.9) 
for any elements xz (6) and y (®). 
It follows that for any solution of system (2.4), proceeding in the direction 


of increasing time ¢, the expression 


(x(%), y(¢+9)) =C (2.10) 
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is a first integral of system (2.3). Here we have an analogy with the theory 
of ordinary linear differential equations with constant coefficients. 

There exists an integer a, such that the multiplicity of any root of the 
characteristic equation (2.7) does not exceed n. Let A, be a root of the 
characteristic equation (2.7). Then the equations 


(A —A,/)*x (8) = 0, (— A°—A,/)“y (8) = 0 (2.11) 


(where / is the identity operator, /x (8) = x (9)) have an equal number of 
linearly independent nontrivial solutions—the root elements «x, (6) of the 
operator A and the root elements y, (@) of the operator —A’*. The solutions 
of equations (2.11) for n, = 1 are called eigenelements, all the other root 
elements are called associated elements. Let the set of root elements of 
the operator A for the roots A,,...,Aw be x, (0),...., xm (8), and the set of 
root elements of the operator —A* corresponding to the same roots be 
y, (6), ..., yw (8)... It can be shown that these elements can always be chosen 
to satisfy the following conditions. 

If x; (8) is an eigenelement of the operator A with no associated elements, 
then 


lL j=e 


5 ae (2.12) 


(x)(®), yo (8) = { 


If the elements x, (@), x;.. (8), ..., Xjm (0) form a Jordan chain (z, (@) is an 
eigenelement, and xj,; (6), .... Xjm (8) associated elements), then a Jordan 
chain yg; (8), .... 94m (8) corresponds to the adjoint operator and we have 


1 o=k, 0K 0oRgm 
0 ok, Osc, (2.13) 


(Xen (8), Yo(#)) =9 (6<j, o>j +m); 


(Xjen (6), Yj+m-o (@)) = { 


and 
A—A,I $) =0 
( pf) x; (8) (2.14) 
(A — Ail) Xjon (®) = Xena (@) (R=1],..., 7). 
Suppose that A,,...,Ay are all the roots of the characteristic equation 
(2.7) satisfying Rea, > —a,(/=1,...,M), where a is some positive number 


(counting each A, with its multiplicity). Let us define V = NM @) functionals 


f(x ()) = (x (8), (8) P= 1... ND. (2.15) 
The conditions 
f1(x(®)) =0, (j=1,....) (2.16) 


determine a subspace JJ (a) in the space C [—+,0]). An arbitrary element 
x (8) of the space C [—+,0] can be represented in the form 


N 
x(0) = 2 Yor Xe(8) +2 (0). (2.17) 
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where 
Yo = fo(x(9)), (2.18) 
in the case (2.12), and 
Your = foom-n(X(0)) (R= 1,...,m), (2.19) 
in the case (2.13). 
Now the system of equations (2.3) may be expressed in terms of the new 


variables y,,...,yw~ and Z(#), assuming the form 


d 
st =Ay, in the case (2.12) 


dt = MYtem 
CY ;4m- 
a = *Ynma—Yuem | in the case (2.13) (2.20) 
dy 
t= hy) — Yin 
bec ay = Az,(@), 2: (9) G7 (a). (2.21) 


The spectrum of the operator A restricted to // (2) consists of the eigen- 
values 4,, o= N+1...., with the exception of the first N:A,,..., Aw. 


3. The theory of linear periodic delay systems 


Consider systems whose motion is described by delay differential 
equations with periodic.coefficients, of the form 


St = Fits (t+ 0), ---tn(t+9)) (S=I,-.s a), (3.1) 


where 


F(t. (0), - +140 (8)) = 5) [Pu (44/0) + 
fui 


K 0 
+ Di Gon (tx; (—te) + J foy(t, Bab}, (S= 1, - 4 ). 


Here 9y;, Jos; are periodic continuous functions of the time ¢ with period o, 
the functions f,,(t, §) are continuous in ¢ and — for rq §< 0, —w< t< + oo 
and periodic in ¢ with period wo, +t > 1, is the delay. 

This type of system was considered in /13, 14, 15, 16, 17/. In the space 
of continuous functions C {[— +, 0) with the same norm || (®)|lk, equations (3.1) 
are equivalent to the following system of ordinary differential equations with 
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operator right-hand side: 


Suit) = P(t) x1(0), (3.2) 
where x, (6) = x (¢+ 0) ={x, (+ 0), —t << @0< 0, s=1,..., nn}, and the operator 
P (6 is defined by 

dx, (0) 
—=—_. — 06<0 
P(t)x(0)={ 4 oe (s=1,...,a) (3.3) 


F,(t, X1(8), ---, 2a (®)), @ = 0. 


Let x (x, (0), f%, t) be a solution of system (3.1) with initial function x, (@) at 
the instant fo, x (X» (8), fo, f + 0) = x, (8). Then x; (6) = x (x, (@), t,, ¢ + 9), 
—t<t<0 defines a solution of system (3.2) with the initial function x, (6). 
For fixed ?, the element x, (6) of this solution can be regarded as the image 
of the initial element x, (6) under the mapping 


xX (@) = T (¢, to) Xo (8), (3.4) 


where T (f, t.) is a linear operator, T (¢, t) = /. 

It may be shown that the eigenvalues of a completely continuous operator 
T ((, + @, 4) are independent of f,. The spectral radius 7 of the operator 
T (t) +, t,) determines the stability or instability of the motion x=0. If 7 is 
greater than unity, the motion x= 0 of the system (3.1) is unstable. If re 
is smaller than unity, the motion x=0 is asymptotically stable. The 
analytical form of the eigenelements and the associated elements of the 
operator is obtained. The corresponding particular solutions of (3.1) are 
continuable to the whole time axis and have the same analytical form. Thus, 
the eigenvector x, (@) corresponding to the eigenvalue p, has the form 


Xo (0) = pO, (), —t<KO<0, (3.5) 


where 9, (6) is a periodic vector with period o, for @ in the interval 
(— oo, + oo). The ®, (8) have derivatives of arbitrary order with respect to 
%. 

Consider the adjoint system of differential equations with time advance 
/16, 17/ 


Pt, yi (t+ 6), ..-,yn(¢+)), (S=1,...,A), (3.6) 


where 
— Fa(t, yr(®),- «+ yn (8) = — 5} pp (4) 9 (0) — 
tema 


a A a ¢@ 
— FD gen (t + te) y (te) — 5} | fe (E—B, 8) 9 (—8)4E (8 = 1,...,0). 


{=1 om jut —t 


Here p,q,f/ are the same as in (3.1). The equivalent system of linear 
ordinary differential equations with operator right-hand side in the functional 
space C [—+, 0] is /16/: 


He =— P(t) (0), (3.7) 
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where 


y:(0) = y(t + 0) = (y, (¢ +6), t>O050, s=],...,A}, 
— P(t) = {FX tor r>0>0, Fit. y() for ¢=0}. 


Let us denote by y (y (0), f, f) a solution of system (3.6) with initial 
function y, (6),1>®>0 at the instant 4, for t<qt; then y» (6) = 4 (y, (8), tb, 
t+ %),+>6>0 is a solution of system (3.7). For fixed ¢, the element 
of this solution can be regarded as the image of the element y, (8) under the 
mapping y, (6) 


Ye (O) = T(E, to) Yo(O),  E< bo. (3.8) 


Again the eigenvalues of the operator T° (¢¢, —o, ¢,) do not depend on ¢, and 
coincide with those of the operator 7 (ft, +, /,). The structure of the eigen- 
vectors and the associated vectors of the operator T (¢, —o, 4) is identical 
with that of the analogous vectors for the operator T (4, +, f). The 
corresponding particular solutions are continuable not only to ¢<4%,, but 
also to t > tg. 

We introduce the notation 


(x(8), (0), £) = Si x(0) 9) (0) + 
fant 


£, 


a 


+BY) aE —v9-4 0) gent + Ya— 
jai (ml om1 0 
a n 0 (') 
— 3B) [S 1G + 079) ult + 8, 0) a8] 0. aa 
jm jwi —t —h 


For any particular solution y; (8) of system (3.9), continuable to ¢>¢,, the 
expression 


(x_¢(®), ye (0), ¢) = const (3.10). 


is a first integral of the system (3.2). 

Suppose that all the eigenvalues p, satisfying the condition |p,;|> e(e 
arbitrary) are simple and let their number be WN (e). The more general case 
is considered analogously /16/. Consider eigenvectors x, (6), y, (@), 
=1,...,M (e)) of the operators 7 (¢, +0, ¢,), 7 (t —o,%). Let x, (6) and 
y,; (®) correspond to the eigenvalue p,; then 


(x;(8), 9/(®), = {ce (3.11) 


Let x(8) and yj) (6) be particular solutions of systems (3.1) and (3.9) with 
initial functions x, (@) and g, (6), respectively (for t=4#,). Every element 
x (8) of the space C {—+,0] can be represented uniquely as a sum 


N — fete 
x (0) = 5) ax) ($—to +O) py, ° +2(), (3.12) 
f~ 
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where 
(=f 


a, = (x (8), yj(t—to + 9) 97" t) (=1,..., N), (3.13) 


t-t, 


(z (0), 41 (to + O)p,* ,t) =0, (f= 1,..., M) (3.14) 


Thus, one can always pass from the variable x (¢) in the system (3.2) to the 
scalar variables a,,...,ay and the variable z (6), which belongs to the 
subspace (3.14). In terms of these variables (3.2) corresponds to the system 


Sh = — Nog py - ay (j= 14+ NY; (3.15) 
AO = P(t) 24(8), 24(0) © (3.16). (3.16) 


Formula (3.12) may then be interpreted as follows. In the space C [—+r, 0] 
one always can find a periodically shifting N-dimensional basis {x, (¢ — t, + ®)x 


t—f, 
Xp,* }UV=1,....).(with period o), The coordinates a, describing the 


motion of the system in the corresponding N-dimensional subspace of 
C{—+,0) then satisfy a system of ordinary differential equations with 
constant coefficients (3.15). The component z (@) in (3.12) belongs to the 
subspace (3.14) and decreases exponentially without limit, with exponent 


B<Z loge(e< i). 


4. Stability in the first approximation and stability 
investigations in the critical cases 


Consider again system (1.1), together with the so-called first approxi- 
mation system (2.1) resulting from system (1.1) by putting X,=0. It was 
shown in /4, 18/ that if all the roots of the characteristic equation (2.7) have 
negative real parts the unperturbed motion x=0 of systems (2.1) and (1.1) 
is asymptotically stable, irrespective of the form of the nonlinear parts X, 
in (1.1). If at least one of the roots of equation (2.7) has a positive real 
part, the unperturbed motion x= 0 of systems (2.1) and (1.1) is unstable, 
irrespective of the form of the nonlinear parts X,/10/. Thus the stationary 
delay system (1.1) is completely analogous to ordinary differential equations 
as regards the problem of stability in the first approximation. The same 
is true of nonlinear periodic systems of type (1.1) with first approximation 
systems of type (3.1). The first proposition, concerning asymptotic 
stability, is proved in /4, 19/. The second (instability) is proved, as in 
/10/, on the basis of the decomposition indicated in /16/. 

When the characteristic equation may have zero or purely imaginary roots, 
we have the so-called critical case. Critical cases of one zero and a pair 
of purely imaginary roots are considered, for stationary systems of type 
(1.1), in /20/. 

Let us investigate stability in the critical case of a single zero root 
4, = 0 in more detail. It is assumed that all the other roots of equation (2.7) 


197 


SHIMANOV 


have negative real parts. The technique is a generalization of Lyapunov's 

method, which he developed for systems described by ordinary differential 
equations. In the case under consideration A (0) =0, but A’ (+0. Let 

A,; (0) denote the cofactor of the element at the intersection of the k-th row 

and the j-th column of the determined A (0) of (2.7). Let Aas, (0) #0, 

y, (8) = {Ap, 0), j = 1, .... at > 8 > 0} = const. Consider the functional 

f (x (0)) = (x (8), y; (0), and the vector x, (6) in C [—+r, 0): 


X1 (8) = (Az, 7 (0)} (Ara, (0) A’ (0))* = const (6), —rQ oC. (4.1) 
Any element x (®#) @ C [—+,0) can be represented uniquely in the form 


xX (0) = x, (0) y + 2 (0). (4.2) 


Equation (2.3) becomes 


dy /dt = 0, 


(4.3) 
_ dz (0) / dt = Az, (6), f [z-(0)) =9, 


while system (1.2) becomes: 


dy /dt = Y (y, z(6)), (4.4) 
dz, (0) / dé = Az, (0) + 2 (y, 27 (0), 0), f [ze(O)}] = 0. 


Here Y (y, z (®)) is a functional defined by Y (y, z(@)) = /{R), and the operator 


Z (y, z (6), ©) = R (x1 (8) y + 2 (6)) — x: (8) Y (y, 2(0)). (4.5) 


Consider the function 
Y (y, 0) = gy* +...,Z(y, 0, O = gi(O)y™+... (4.6) 


Let m,>m. Then if m is odd and g<0, the motion x =0 is asympto- 
tically stable. If m is even or odd, but g>0, the motion x =0 is unstable. 

The condition m, > m can be ensured by the substitution z (6) = z, + u (@, y), 
where u (6, y) is a solution of the equation 


Au(0,y)+Z(y, u (0, y), 0) =0, (4.7) 


which satisfies the condition / [u (®, y)} = 0. On the plane / [x (6)] = 0 equation 
(4.7) has a unique solution u (6, y), which is analytic with respect to y and 
differentiable with respect to ®. 

This type of transformation always ensures the condition m, >m, unless 
we have the singular case Y (y, u (6, y), 6) =90. In this case the unperturbed 
motion x=0 is stable /20/. 
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CELESTIAL BALLISTICS 


G.L. Grodzovskii, Yu. N. Ivanov, and V.V. Tokarev 


OPTIMIZATION PROBLEMS IN THE MECHANICS OF 
LOW-THRUST SPACE FLIGHT 


The development of space rocket technology is creating new branches of 
mechanics. The idea of using rocket engines for space flight, which took 
shape at the turn of the century, has stimulated the development of the 
mechanics of space flight (Meshcherskii, Tsiolkovskii, Tsander, Kosmo- 
dem' yanskii, Goddard, Hohmann, and others). This science studies the 
motion of space vehicles as bodies of variable mass in order to determine 
conditions for the delivery of a maximum payload. Here the type of 
propulsion system plays a decisive role. 

The mechanics of high-thrust space flight studies the flight of rockets 
with heat engines, the name indicating their characteristic feature —small 
weight/thrust ratio; the thrust should be of the same order as the weight 
of the rocket. 

At the turn of the century the principles of low-thrust flight (solar sail, 
electrical jet engines) were formulated. The outlook for these propulsion 
systems is determined by the small mass flow per unit thrust (for electrical 
jet engines) or the absence of mass flow (for solar sails). Recent techno- 
logical progress has raised the problem of scientific and technological 
realization of these principles, and a new branch of mechanics has 
developed—the mechanics of low-thrust space flight. 

The origin of the term "low-thrust’ is connected with the high (at the 
present stage) specific gravity of the propulsion unit; vehicles with such 
engines need a thrust which is small in comparison with their weight and 
which operates over a prolonged time. Investigation has shown that, as far 
as the payload is concerned, vehicles with low-thrust engines generally 
have advantages over others in identical space maneuvers. 

The present paper deals mainly with the optimization problem in the 
mechanics of low-thrust space flight. This consists in selecting optimal 
relations between the weight components of the vehicle and optimal control 
of the propulsion system, and determining the set of optimal flight 
trajectories. A more detailed exposition and an extensive bibliography 
can be found in the authors' survey /1/. 


1. General formulation of the optimization problem in 
flight mechanics 


In formulating the purpose of a space maneuver, the problem of its best 
realization arises. Usually it is required to deliver the maximum payload 
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for given launching weight and fixed performance time. In the sequel we 
describe a general formulation of this problem, without referring it to 
specific propulsion systems. 

We first enumerate the basic weight components of the vehicle: Gm, Gm (A 
are the initial and current weights of the propellant; Gio, G. () are the 
initial and current weights of the propulsion system (if sections of the 
engine are jettisoned during the motion); Gp is the weight of the payload; 

G. G( are the initial and current values of the total weight of the vehicle 


G = Gn + Gi + Gp (1.1) 


All the weight components not involved in the optimization conditions are 
usually included in the payload. 

Let us establish the relation between the weight characteristics and the 
parameters of motion of the system, and determine the possibility of thrust 
control. An obvious relation between the mass flow g and the current 
weight G, (f) of the propellant is 


Gn = — gq. (1.2) 


The weight G, of the propulsion system is a function of the extreme 
values u; mex and umn Of the internal control parameters u, of the engine 


Gy = Gy (Usmax, Uyain)s (1. 3) 


which control the thrust P and mass flow q: 


P=P(u,), g=q(uj); (Bsmin <4; & Uy max): (1.4) 


Relations (1.3) and (1.4) must be specified for each type of propulsion 
system. * 

The motion in a gravitational field of the center of mass of a space- 
vehicle due to the thrust can be described by a vector differential equation 


r=(P/G)gi+R, (1.5) 


where r(f) is the radius-vector in an inertial coordinate system, R (r. is 
the gravitational acceleration vector, { is a unit vector in the thrust 
direction. 

The complete system of equations describing the behavior of the vehicle 
consists of the vector equation of motion (1.5), the weight relations (1.1)— 
(1.3), and the control characteristic of the engine (1.4). 

The trajectory of the vehicle must usually start at a given point of the 
phase space (i.e., with given coordinates and velocity components) and 
reach another point in a fixed time. This requirement may not be satisfied 
uniquely (owing to the presence of free control functions w,, i). 


Formula (1.3) is called the weight formula of the engine, and (1.4) its control characteristic. The form of 
these relations depends on the type of engine. The choice of a system of independent control parameters 
depends on the restrictions characteristic of the given type of engine: if explicit restrictions are imposed 
on any contol parameter, it is appropriate to retard it as independent. 
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From the whole set of trajectories connecting two given points in the 
phase space {to, fo} and {r,, ¢,} in a fixed time T, it is required to selecta 
trajectory which corresponds to motion of the vehicle with maximum payload 
Gp for a fixed launching weight G, = Gap + Guo + Gp. 

This is the variational problem of optimal controls of the engine and 
optimal relations between the weight components. The control functions 
are the thrust direction vector and the engine parameters, which determine 
the thrust and the weight G. 

Another, equivalent, variational formulation is: It is required to 
determine a trajectory which connects two given points of the phase space 
in minimum time for given launching weight and payload. The second 
formulation may be more convenient in cases when there is no certainty as 
regards the feasibility of the maneuver in the time range under consideration, 
from the power aspect. 

These two basic variational formulations may vary according to the 
specific character of the problem, changing the criterion function and the 
boundary conditions. The latter correspond to the type of space maneuver: 
inter-orbital transfer, escape from the gravitational field of a planet, 
evolutions in the neighborhood of a planet, and so on. In such cases the 
variational formulations do not impose all the boundary conditions on the 
coordinates and velocities —only part or combination of the coordinates and 
velocities are given. The free boundary conditions are then determined 
from considerations of the optimality of the trajectory. 


2. Mechanics of solar-sail flight 


The basic unit of a propulsion system with a solar sail is a mirror- 
surface producing thrust by reflection of solar light. The first serious 
investigation of this problem was made by F.A. Tsander /2/. He showed 
that realization of interplanetary flights with the aid of solar pressure is 
possible in principle. 

The formulation of the optimization problem fora solar 
sail requires, according to the general procedure described in Section I, 
the specification of relations (1.3) and (1.4). Here the mass flow is zero 
(q = 0, Gn = 0), and the thrust produced by a flat, ideally reflecting, sail of 
area S at a distance R from the sun is 


P = po(Ro/ R)*S(m-e)*n (Po == 0.9446 - 10-8 kg /m?, Ro == 0.1495 - 10! m). (2.1) 


The only control parameter u, of this simple propulsion system with 
constant area Sis the angle between the unit vector n of the normal to the 
shaded side of the sail and the unit vector e of the direction of the solar 
rays. Variation of this angle (in the range [—*%/,x, + /,n]) changes both the 
direction and the magnitude of the thrust. 

The weight G, of the propulsion unit is determined principally by the 
weight G, of the sail itself 


G, = G, = gpSd, (2.2) 


where p is the mass density of the sail material (p ~1 —2g/cm$), 8 is the 
thickness of the sail (6 ~ 2-10°? — 10° mm). 
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Using the kinematic parameter 


Gy = PoS (Gp/ g + PS8)™" & po/ pd (2.3) 


of the acceleration imparted by a sail perpendicular to the solar rays (n = e) 
to the cosmic device at a distance R, from the sun, the weight components 
can essentially be eliminated from the optimization problem 


Gp/ Gy = 1 — ay (pd/ po). (2.4) 


The mass 6 of unit sail area depends on the quality of the construction 
and should be the smallest possible. Thus, the relative payload may be 
regarded as a function of the acceleration a. 

The equation of motion (1.5) can also be expressed in terms of a: 


f = d9(Ro/R)*(n- e)*n + R. (2.5) 


The optimization problem may then be formulated as follows: Finda 
control law of the sail orientation n (f (\n ()| =1) and a trajectory r (t) (2.5) to 
ensure minimum a, (i.e., maximum GjJ/G,, by (2.4)) for fixed time T or (by 
virtue of their mutual reciprocity) minimum T for fixed acceleration a,. 
The first formulation is less convenient, since the resulting acceleration 
a, may turn out to be greater dom: = p/p6 (corresponding to Gp>< 0. see 
(2.3)). 

Interorbital transfer. Most of the current literature on the solar 
sail investigates plane motion in the central gravitational field of the sun 
when the angle between the sail and the solar rays is constant. A number 
of exact and approximate solutions of equation (2.5) have been obtained and 
the optimum sail angle for minimum interorbital transfer time has been 
found. These analytic solutions have an essential shortcoming: the 
boundary conditions for the coordinates and velocities at both ends of the 
trajectory cannot be satisfied. 

The variational problem of an optimum program for the sail angle is 
solved numerically for given boundary conditions for interorbital transfer 
in /3, 4/. The first of these papers uses 
the gradient method of control optimization, 
while in the second the problem is reduced 
to a boundary-value problem for a system 
of differential equations (by means of the 
maximum principle), and solved by Newton's 
method. Figure 1 (curve 1 ''Earth—Mars'' 
/4/) gives the dependence of minimum 
transfer time between circular coplanar 
orbits of Earth and Mars on the initial 
acceleration (2.3). 

The escape of a solar-sail vehicle from 
the gravitational field of a planet is considered 
in /5, 6/ under the following assumptions: 


"J days 


e Qs 0.3 a5 a7 the vehicle is "parked" in an initial circular 
a, cm/sec’ orbit, the gravitational field is central, the 
flux of solar light is plane-parallel with 
1 constant intensity and direction. 
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These papers do not study the variational problem of an optimum sail- 
orientation program. The paper /5/ considers rotation of the sail with 
angular velocity equal to half the angular velocity of revolution of the 
vehicle around the planet, the initial orbit and escape trajectory being in 
the plane of the ecliptic. In /6/ the angle of the sail is chosen from the 
local-optimality condition of maximum rate of energy increment; the plane 
of the initial orbit is perpendicular to that of the ecliptic, in order to 
eliminate shaded sections of the trajectory. 

A numerical solution of the Cauchy problem for the equations of motion 
(2.5) in dimensionless form, with the corresponding sail-orientation 
programs, has shown that the time needed to reach the escape point by 
the scheme of /6/ is somewhat shorter than in the case of the two-dimen- 
sional trajectory /5/. However, this is not a final indication that accelera- 
tion in a plane perpendicular to the ecliptic is preferable, since both papers 
study nonoptimum sail-orientation programs. A comparison with inter- 
orbital transfer time shows that a considerable part of the latter is occupied 
by the time of escape from the gravitational field of a planet (compare, in 
Figure 1, the curve ''Earth—Mars' with curve 2 "escape from the Earth's 
field" /5/). 


3. Mechanics of power-limited flight 


Power limited propulsion systems consist of a power generator and 
propellant unit. The latter transforms the energy produced by the generator 
into the kinetic energy of the directed motion of the jet which produces the 
thrust. The presence of a separate generator of limited power determines 
the basic features and the designations of this category of engines. 

Examples of power-limited propulsion systems are electrical jet engines. 
Power generators mentioned in the literature are, for example, solar 
collectors, nuclear and isotope reactors; the propellant units may be 
electrical boosters of various types: electric-arc, electrostatic and 
electrodynamic. 

Formulation of the optimization problem. The basic 
parameters of power-limited propulsion systems are: g—the mass flow 
rate of the propellant, V—the exhaust velocity, P—the thrust, MN—the jet 
power. A characteristic feature of these systems is their controllability: 
the parameters g and V may be varied within wide limits. It is also 
possible to control the power supply to the engine. If the distribution of 
the parameters over the cross-section of the jet emerging from the 
propellant unit is uniform, we have the following two simple relations for 
mass flow velocity, thrust and jet power 


P=qv, N= qv". (3.1) 
Thus, only two of these parameters may be considered independent. 
The jet power WN is always limited by the maximum power VN, of the 


generator: 


0K NK Wo, (3.2) 
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where nis the coefficient of conversion of the generator power into jet 
power (the efficiency of the propellant unit). In an ideal propellant unit all 
the input power is transformed into jet power (yn = 1). In actual units losses 
occur (yn <1), the efficiency depending on the engine parameters, for 
example, on the exhaust velocity V. For actual, in contrast to ideal 
propellant units, restrictions may be imposed on the exhaust velocity or 
mass flow, apart from those on the power (3.2). 

Corresponding to the restrictions imposed, Nand V or WN and q are taken 
as the two control parameters. Then 


P=2N/V, q=2N/V3 (3.3) 
or 
P= y2Nq, q=q. (3.4) 


For an ideal propellant unit (when V and gq are unrestricted and n= 1) 
it is convenient to represent the thrust and the mass flow in terms of the 
power WN and the kinematic characteristic a—the jet acceleration: 


P=aG/g, q =a°G"/2Ng". (3.5) 


The weight G, of a power-limited engine is determined mainly by the 
weight Gy of the generator, which is proportional to the maximum power 
/7, 8/: 


G, == Gun =aNo. (3.6) 


The specific gravity a of the power generator for large values of N, 
(V, > 10? kw) is almost constant. The values discussed in the literature are 
a~1—10 kg/kw. 

Corresponding to the three combinations of the control functions (3.3)— 
(3.5), the mass flow equation (1.2) and the vector equation of motion (1.5) 
can be represented in three forms 


7 ay ah en! eee 

Go=—8iy T= VG,4+6, 46) '+R: 

5 ee V2N 

Gz = —8q7], r= G40, ee i+R; (3.7) 
; *(GatGyt+G a 

Gm = — Se ne | taal +R. 


The complete system of control functions comprises the vector-function 
i (O (li] = 1) which determines the orientation law of the thrust, the function 
N (9 — the jet power 0 < N () < nA,), the function Gy ()— the weight of the power 
generator (the latter may either remain constant along the trajectory or 
decrease (dGy/dt < 0), and one of the functions V (9, ¢7(), a(f), depending on 
the form of equations (3.7). 

The variational problem for a power-limited engine is formulated as 
follows: It is required to determine optimum controls 1 (0, Gn (0, N (. 
and a((orV (f), or q (0) which guarantee delivery of maximum payload 
in a given maneuver, for given launching weight G,, in fixed time T (or 
given payload G, in minimum time T). 
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The optimal power control is found to be extremal inallcases(N () = nAN,, 
or 0—maximum power utilization on the active sections). The choice of 
an optimum law for Gy (9 (where decrease in the weight of the power : 
generator is permitted) or an optimum value for Gy (where the weight of the 
generator is assumed constant) (the weight part of problem), cannot in 
general be separated from the choice of the other controls. However, for 
an ideally controllable engine (or under direct restrictions on the jet 
acceleration) the weight part of the problem can be solved independently 
of the choice of the control a (f) = ai which determines the trajectory. 

Ideally controllable propulsion system — optimum 
weight relations, optimum power control. 

If the weight of the power source is constant, the fifth equation of (3.7), 
the mass flow equation, can be integrated by quadrature, giving the payload 
weight in the form /9, 10/ 


G1 Go = (14 72 Gat)" —Gu/G, (3.8) 


The control function M (9 appears only in the integrand of (3.8) and not 
in the equation of motion (the sixth equation of (3.7)). If the power has a 
constant upper limit NM, = const, which is the case here, the optimum law 
for N (, delivering maximum Gp (corresponding to a minimum of the 
integral in (3.8)) is determined independently of the other control functions 
/9/: NO=No. 

Taking AN outside the integral in (3.8), the integral 


6 
J = avde (3.9) 


remaining in the expression for the payload is independent of the power N, 
and the weights Gy and G, and can be calculated if the trajectory (Mis 
known. This integral represents the trajectory part of the problem in the 
payload formula. The smaller its value, the greater the payload for fixed 
G,, No and Gy. Minimization of J is the trajectory part of the general 
problem; selection of optimum Gy is the weight part of the general problem. 
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If the weight Gy of the power generator is a linear function of the power 
N,, then, for a fixed value of the functional J, G, (Gy) is a maximum for 


Gv /Go=Y O—O, (M = (a/ 2g) J) (3.10) 


and then /9, 10/ 


Gp/ Gy = (1 — ¥ O)?. (3.11) 


The optimal weight relations Gp (®), Gz (©) and Gy (®) are represented in 
Figure 2. Note that for constant weight of the power generator, a maneuver 
for which J> 2g/a (G = 0 for ® = 1) cannot be performed. 

If the relations Gy (N,) is nonlinear, the optimal weight relations cannot 
be expressed explicitly. In this general case the equation for the optimum 
power has the form /11/ 


dGy (Ne) ;Ne 4 : 
dNe Ges VI 72g +7712) =I. eins) 


The weight of a power-limited propulsion system amounts, as Figure 2 
shows, toa considerable part of the launching weight of the vehicle. If the 
payload is small, the weight of the engine predominates toward the end of 
the motion, when most of the propellant has been consumed. It may 
therefore be expedient to jettison sections of the propulsion unit during the 
motion. 

For heat engines the problem of optimum staging is essentially that of 
optimum jettisoning of the tanks, since heat engines have low specific 
gravity. For power-limited engines the problem acquires a new formulation— 
to arrange for optimum decrease in the weight generator /12/. 

Decrease in Gy implies a proportional decrease in the maximum power 
N, () = Gw (\/a (@ = const); maximum power is used everywhere WN (f) = N, (¢). 

For n-stage power decrease, the maximum payload is /1, 12/: 


a 1. 2 ; 

Gp Go= (I VBP D (Fe) (3.13) 
where the sum £9, = @ is given, while the optimum relation between the 
@®, (the distribution over the sections) can be obtained by differentiating 
(3.13) with respect to @,(i=1,...,a). The trajectory part of the problem 
again reduces to minimizing the functional (3.9). 

Figure 3 compares the payload for constant weight of the power generator 
(an = l)andfor stepwise (n= 2) decrease in the weight of the power generator. 
By stepwise decrease in Gy the maximum value of the functional (3.9) may 
be increased toJ = 2vg (Gp= Ofor DO = a). 

For an infinite number of sections (n — oo), we have the limit case — 
continuous decrease in the weight of the power generator /13, 14/. 
Replacing N by a new control function v = N/N, =aN/Gn, DE ¥v(9 | 1, we 
rewrite the third pair of equations of (3.7) as 

Peccts (Cn + On+ Op «@ 


ic + 3 r=v, v—-al+R. (3.14) 
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To determine the optimum control functions Gy () and v (), we formulate 
a variational problem for the minimum flight time between two fixed points 
of the phase space for given G, and Gp. 

The boundary conditions for the weights are: 


Gy (0) + Gm (0) = Go— Gp, Gm(T) = 0. (3.15) 


The characteristic feature of this problem is that the intial value of the 
control function Gy is not arbitrary, but occurs in the first of the boundary 
conditions (3.15). Thus the standard algorithm of variational calculus 
cannot be used. 

Using the maximum principles the optimality of the controls v (¢) = 1, 

Gy (f) = Gu (f) + Gp Or Gy (f) = const, is established. Therefore the problem 
Splits into two parts. 

1. For a given ratio G/G,, to find the optimal control Gy (), consisting 
of the sections Gy = Gy + Gp and Gy = const, which satisfies the initial 
condition (3.15) and maximizes the functional 


GydG 


2. For a given value of % to find the optimal controls a(t) and 1 
guaranteeing minimum flight time between two fixed points of the phase 
space. * 


* Hence, for fixed controls a(t), e.g., for a = const/15/, the problem is a special case of this formulation. 


209 


GRODZOVSKI, IVANOV and TOKAREV 


The first part deals with the weight, the second with the trajectory. 
Solution of the first part of the problem yields 


Gu =const = (Y Gp / G.— Gp/ Go) Go, for 0.25< Gp/G,< 1 


0.25G, for 0.25Gp—Gp< Gum < 0.75G) — Gp 
Bis Gu + Gpfor 0< Ga <0.25G) — Gp. for 0<Gp/Gy < 0.25 


The function ® (Gp) is (see curve n= oo in Figure 3): 
= £[1—1n(4Gp/ Go)} or Gp/Go = — exp (1 — 40) (3.17) 


The jettisoned sections of the power generator, by an idea of Tsander, 
can be partially utilized in the propellant unit. Thus we have the problem 
of optimum utilization of these sections /16, 11/. 

Only part of the mass of the jettisoned sections may be converted into 
propellant: y (0 < ymex = const < 1 (the parameter ya. is assumed given). 
The above formulation of the variational problem is maintained, on the 
whole. The only addition is a new control function y(). The system of 
equations determining the behavior of the vehicle is expressed in terms of 
the mass flow (see the second pair of equations of (3.7)): 


Ga =— gq, Gu=—gqy, F=V, 


v = y (2/2) Guv(q + 19) (Gm + Gv + Gp) 214 R. 


Here Gy appears not as a control function, but as a phase coordinate. As 
a control function for the variation in the weight of the power generator the 
mass flow qy () > 0 is introduced. The boundary conditions remain as 
before (3.15). 

The maximum condition for the Hamiltonian function of the problem 
implies that here too the jet power must be maximal (v (9 = 1). The control 
y (9 is also maximal, y( = yax- There are three optimal variation laws 
for the weights Gy and G,, which can be expressed in terms of the jet 
acceleration a instead of the mass flows g and gy. Integration of the above 
equations yields the final relation between the weights Gp, Gy (0), Gw (T), Gp, 
the parameter yma and the functional ®; Gp turns out to be a monotone 
decreasing functionof®. Thus, as before, the trajectory and weight parts 
of the problem may be separated. 

The solution of the latter is shown in Figure 4 as a function Gp(®) for 
various values of yma. Clearly, an active power drop (ymex>>0) may yield 
a significant gain in payload. 

Investigation of stepwise variation of the weight of the power generator 
has shown that the limit payload (n = oo) can be approximated by a finite, 
comparatively small number of sections n(Figure 5). 

Ideally controllable propulsion system — optimal 
trajectories and programs for the jet acceleration 
vector. It was shown above that the optimization problem for an ideal 
system may be divided into two independent problems: 1) to find the 
optimum ratio between the weight of the power generator and the weight of 
the propellant, and 2) to find the optimum trajectories rf and programs 
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for the jet acceleration vector a (f), which are related by the differential 
equation of motion 


F=a+R(r,t), (FO)mry fOlmin r(T)= ry F(T) = th) (3.18) 


and minimize the functional (3.9). 
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Optimal control of a (Ain the ideal case never reaches the limit @= 0. 
Thanks to the absence of limit controls, the laws of optimal motion may be 
established within the framework of classical variational analysis. 

The optimal trajectories and optimum programs for the jet acceleration 
are found by solving a system consisting of the equations of motion (3.18) 
and the Euler-Lagrange equations 


a = (0/0r)(a- R). (3.19) 


The system (3.18) and (3.19) is of order 12 in the three-dimensional 
case. There must also be 12 boundary conditions; one form of these is 
given above (3.18). 

Usually the following three basic trajectory problems are considered: 
interplanetary transfer, evolutions in the neighborhood of a planet (rotation 
of the orbital plane, correction of the orbital elements, compensation for 
a constant force), and reaching a satellite of the planet. The problem most 
Studied at present is that of interplanetary transfer. An interplanetary 
transfer trajectory has two characteristic sections: motion in the neigh- 
borhood of the planet and motion in the field of the sun (concerning the 
boundary between these sections see /17/). Accordingly, two basic 
problems for a central gravitational field are formulated: escape from 
the gravitational field (attaining zero energy) from a given point of the 
initial orbit in a given time; transfer from a given point of the initial orbit 
to a given point of the final orbit in a given time. The missing boundary 
conditions are determined by equating the terms outside the integrals in the 
first variation of the criterion functional of the problem to zero /17, 18/: 


T 
j={ (F — R)* de. (3.20) 
0 
If the gravitational acceleration is neglected (R = 0), the system of 
equations (3.18) and (3.9) is easily integrated 
a(t) = bof + bi, (ft) = 5 bol? + bil? + bat + by. (3.21) 
The constant vectors B),...,8, are determined from the boundary 
conditions. The two basic problems for a central field correspond here to: 


1) attaining a given increment Av of the absolute value of the velocity 
in time 7 (in the case of free final coordinates) 


Sopt(t) = (Av/T) Vo/ ve, Imin = Av? /T'; (3.22) 


2) transfer in time T between two stationary points at a distance L from 
each other: 


opr(t) = (BL /T*) (1 — 2t/T), Jmin = 12L°/ T*. (3.23) 
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For plane motion in a central field (R = —&r/r’) the system (3.18), (3.19), 
in polar coordinates (r,g) can be integrated twice /9, 10, 12/: 


r=0;, Q=vU~/r, 0, =a, +u8/r—k 1, Ue = dy— U,04/F, 
G, = (1 /0,) [fa (a? + a8) + a, (03/7 —k/ r*) —2—vw,/ 7], 
Gy = (1/1) (ag0; — 2a,;V_ + v). (3.24) 


The constants 4 and v are integrals of the system. This is connected 
with the fact that the equations of motion are independent of the time # and 
the polar angle g. These parameters play the role of isoperimetricity 
constants: the parameter v corresponds to a given angular displacement 
Ag (for v=0 Ag= A®opr)> and the parameter 4 to a given time of motion T 
(ford4=0 T=T.). The system (3.24) has a singular point—a node —for 
ve =0/12/. 

The system of vector equations (3.18), (3.19) has four integrals in the 
three-dimensional case /18/ and two integrals in the two-dimensional case 
/18, 19/, analogous to v and 4. A particular solution of the system (3.18), 
(3.19) is given in /18/: 


a = fo/(1 + 7/:8¢), 8 = const. 


A number of approximate solutions of variational problems for a central 
gravitational field have been obtained. 

In the escape problem /12, 20—22/, optimum energy gain occurs fora 
nearly tangential acceleration direction, due to the smallness of the jet 
acceleration. If the initial orbit is circular, the trajectory at the start of 
the motion is a gently sloping spiral, on which the gravitational and centri- 
fugal accelerations are approximately equal v/rzkr. This makes it 
possible to integrate the equations of motion and find the optimal jet 
acceleration a, = const, a,~0. In the neighborhood of the escape point this 
solution is not valid. 

An approximate method for solving the variational equations for the 
transfer problem is given in /18, 23/. The method is based on the linear- 
ization of the equations in the neighborhood of a certain plane Keplerian 
trajectory, the transporting trajectory, which partially satisfies the 
boundary conditions (for example, it has the given coordinates at the 
beginning and end of the motion). The method is applicable when the transfer 
trajectory does not complete one revolution around the center. 

Approximate solutions for optimal three-dimensional motions of a 
satellite in the neighborhood of a planet are given in /24—27/. 

The numerical methods for the problem of optimal trajectories fall into 
two groups. In the first, the system (3.18), (3.19) (or (3.24)) is integrated, 
with the corresponding boundary conditions. The boundary-value problem 
reduces to a Cauchy problem with selection of the missing initial conditions 
(usually by the Newton method). Alternatively, direct methods may be used 
to minimize the functional (3.20). _ 

Some typical results for the escape problem /9, 10, 17, 22, 28/ are given 
in Figures 6—8. All the parameters are dimensionless; the characteristic 
distance is the radius of the initial circular orbit r,, the characteristic time 
is the period of revolution in the initial orbit, divided by 2n—7,=V ye. 
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The absolute value of the jet acceleration is almost constant along the entire 
escape trajectory. The gain in the functional due to programming of the jet 
acceleration vector is shown in Figure 8 (the dashed curve corresponds to 
a, = const, a, = 0). 


To find optimal transfer trajectories between coplanar orbits, system 
(3.24) is integrated. If the angular displacement is not given /9, 10, 17, 18, 
23/ but taken as optimal, then v=0. An example of such an Earth-Mars 
transfer is shown in Figure 9 (optimal a, (f), a, (¢) and optimal trajectory). 
Transfer with optimal angular displacement are possible only for certain 
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positions of the planets in their orbits; the optimal launching dates recur 
each synodic period. 

If a given launching date is added to the given flight time 7, or if re- 
entry is being considered, the angular displacement Ag is not optimal, but 
is chosen to satisfy one of these additional conditions; when this is done 
v + 0. 

Figure 10 /29/ describes the criterion functional for transfer between 
the circular orbit of Earth and the coplanar elliptical orbit of Mars for 
various launching dates and flight times. At the points of intersection of 
the two families of curves of equal flight time the solution of the boundary- 
value problem is not unique. At these points the time 7, the initial and 
final values of the coordinates and velocities and the functional coincide. 
Types of trajectories and optimal jet accelerations are distinguished 
(Figure 11). 
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For transfer with re-entry the optimum condition for combination of the 
trajectories ''there" and ''back'"' is the equality of the parameters A and v on 
both trajectories. For transfer between circular orbits this condition is 
satisfied for "symmetric" trajectories. Symmetric transfer with re-entry 
is probably optimal for small times. For extended times examples of 
optimal nonsymmetric flights can be given (Figures 12, 13 from /30, 31/). 
Data on transfer between circular coplanar orbits of Earth and Mars are 
presented in /32/, giving the values of the functional and the parameters 
v and 4 for each pair of values of Ag andT. This makes it possible to 
find the optimal transfer with re-entry. 
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Estimates of the influence of noncoplanar trajectories /18, 30, 31/ and 
noncentral gravitational fields /17/ on the integral transfer characteristics 
have been made. In /10/ the problem of optimum distribution of a given 
interplanetary transfer time among the different sections is considered. 
The paper /23/ contains results of systematic calculations of Earth-Mars, 
Earth-Venus and Earth-Jupiter transfer trajectories with re-entry. 
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Once the trajectory part of the problem has been solved on the basis of 
the known a (4, all the parameters of an ideally controllable power-limited 
propulsion unit can be determined: the power WN, the thrust P (f), the mass 
flow g(f, the exhaust velocity V (9; for example, for a single-stage vehicle: 


¢t 
N =(1/a)(V¥Or— ©), (1 = © (7), ©) = (2/24) \ ara); 


P(H=G(tha(t)/g, CO=A(ii+G® (1) aN)"; 
g(t) = G*(t) a* (t) /2g°N, V(t) =2Ng/G(ta(t). 


The maximum payload G, can be determined from formulas (3.10), (3.17), 
and Figures 2—5 for a known final value of the function ®, =@® (7). 

The optimization problem as applied to an ideally controllable power- 
limited propulsion system was briefly described above. The parameters 
of the propulsion system, the mass flow and exhaust velocity could be varied 
in this limit case within unlimited ranges, the only condition being the 
restriction of jet power. The maximum power was considered constant 
over the whole range of the variation of these parameters. This idealization 
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made it possible to simplify the solution of the problem considerably and 
to determine the maximum possibilities for this type of propulsion system. 

The limited length of this paper does not permit us to describe further 
investigations of the problem, as applied to real systems. We confine 
ourselves to an enumeration of the principal trends. 


Waiting 
48 days 


Mars \ 
312 days 


Mars- Earth 


18 


1. Uncontrollable propulsion system —optimum weight relations, optimal 
trajectories and thrust orientation programs /30, 31, 33—35/. 

2. Optimal trajectories with constant and limited thrust acceleration 
/18, 36—39/. 

3. Restrictions on the control characteristics of the propulsion system 
/19, 30, 31, 40/. 

4. Restricted operation time of the propulsion system /11, 33/. 

5. Restricted number of positions of the controller /41/. 

6. Optimum combination of power-limited propulsion systems and 
restricted exhaust velocity /21, 42/. 

7. Reliability considerations in optimization problems for power-limited 
propulsion systems /14, 43/. 

8. Allowance for correction in optimization problems for power-limited 
propulsion systems /11, 44/. 

9. Additional weight components of power-limited engines in the 
optimization problem /45/. : 

A description of these problems can be found in the second and third 
parts of the review /1/. The fourth part of the review describes known 
analytic solutions of the equations of dynamics, exact and approximate, 
based on the smallness of the perturbing force, and also numerical methods 
for design of optimal trajectories and optimal controls. 
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V. A. Sarychev 
GRAVITY ATTITUDE STABILIZATION OF SATELLITES 


1. General principles of the system 


An important aspect of the development of space technology is the 
construction of oriented Earth satellites. Depending on the problems to 
be solved, the attitude an artificial satellite can be stabilized either actively 
or passively (i.e., without consumption of energy). Active attitude control 
systems in projects of long duration (over a year) requiring high precision 
entail large consumption of power or fuel, and these systems are very heavy 
and complex. In many cases it is preferable to use passive systems; their 
merit is especially apparent in long-duration projects where the required 
precision of attitude stabilization is of order 1°—5°. This is the case, for 
example, for meteorological satellites, communication satellites, and many 
scientific research satellites. 

Passive stabilization systems may utilize the properties of the magnetic 
and gravitational fields, the effect of radiation pressure, atmospheric drag, 
etc. Here we shall consider attitude stabilization of artificial satellites 
with respect to the trihedron formed by the radius-vector, the transversal, 
and the binormal to the orbit, called the orbital coordinate system. The 
stabilization principle is based on the property of the Newtonian force field 
according to which a body moving in it orients itself in a definite manner. 

It is well-known /1, 2/ that a body moving in a circular orbit has stable 
positions of relative equilibrium, in which the major axis of the ellipsoid 

of inertia of the body coincides with the radius-vector and of the minor axis 
with the binormal to the orbit; the medium axis is directed along the tangent 
to the orbit. Examples of natural gravity attitude stabilization of a body 
with respect to the attracting body are the moon and Mercury. 

In a conservative Newtonian force field the amplitude of oscillations of 
a satellite around a stable equilibrium position does not decrease in time; 
consequently, the precision of the attitude stabilization is determined by the 
initial values of the angles and angular velocities of the satellite. In 
stabilization systems for artificial satellites the difficulties associated with 
the conservativeness of the field are eliminated by introducing dissipative 
forces. This may be effected by various methods, but in any case the stable 
positions of relative equilibrium become asymptotically stable. 

The nonuniqueness of the stable equilibrium positions imposes restrictions 
on the initial conditions of the satellite after its separation from the last 
stage of the carrier rocket. Before a system of gravity attitude stabilization 
can be introduced, large initial values of the satellite's angles and angular 
velocities must first be reduced by an active damping system to values which 
ensure that the satellite cannot pass from one stable equilibrium to another. 
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2. Problems of design 


In /3/ the following fundamental scheme for stabilization and damping 
of satellites is proposed (Figure 1). A stabilizer is attached to the satellite 
by means of a three-stage suspension P. The stabilizer may be any body 
with suitable moments of inertia. In construction it may consist of two 
booms, rigidly connected to each other, with weights at their ends. Besides 
the suspension P, which is kinematically equivalent to an ideal spherical 
hinge, an elastic joint is introduced between the satellite and the stabilizer. 
First, using the stabilizer the ellipsoid of inertia of the system as a 
whole may be reoriented (if necessary). Second, by utilizing the relative 
motion of the satellite and stabilizer friction may be introduced in the 
satellite-stabilizer system. The parameters of the stabilizer (boom length, 
weight, angle between booms) and the elastic joints are chosen so that in 
a stable equilibrium position of the system in a circular orbit the booms lie 
in the orbital plane and 0O,X, || O,X,, 0,Y,)|I0,¥;, where O,X,Y,2, and O,X,Y,2Z, are 
the principal central trihedra of the satellite and stabilizer. It is natural 
that, together with the damper parameters, the parameters of the stabilizer 
and elastic joints should be chosen to 
ensure, in addition to the stability 
conditions, satisfactory dynamics and 
precision of the stabilization system. 
From the constructional viewpoint, 
the most complicated element of this 
system is the three-stage suspension. 
The difficulty is due to the following 
stringent dynamic requirements: 1) 
minimum dry friction in the suspension; 
2) high precision of its structure; 3) 
small torques of the elastic joints, of 
magnitude comparable to that of the 
gravitational torques. The first two 
factors determine the precision of 
stabilization of the satellite, the third 
largely determines the duration of the 
transient process. 
In /4/, we study whether the suspen- 
sion may be simplified by reducing the 
p | number of degrees of freedom. In 
deriving the equations of motion of the 
system in question, it was found that small oscillations in the pitch angle 
(in the orbital plane) do not depend on the roll and yaw angles, while 
oscillations in the roll and yaw angles are interrelated and independent of. 
the pitch angle. Clearly, because of the interrelation between the oscilla- 
tions in the yaw and roll angles, it is sufficient to introduce damping for the 
pitch and, say, yaw angle. Oscillations in the roll angle must then be 
damped owing tothe dynamic coupling with the oscillations in the yaw angle. 
This implies simplifying the suspension design by constructing it with two 
degrees of freedom, eliminating, for example, oscillations of the stabilizer 
relative to the satellite in the roll angle. 
A suspension with a single degree of freedom is also feasible. To this 
end one must eliminate the symmetry of the scheme of the stabilization 
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system, which leads to the independence of oscillations in the yaw and roll 
in the orbital plane, and consequently requires damping oscillations in at 
least two channels. Figure 2 is a diagram of a gravity stabilization system 
with a suspension having a single degree of freedom. Figure 2a (view 
from the side of the normal to the orbital plane) illustrates the scheme in 
the equilibrium position, the plane of the booms coinciding with the orbital 
plane (the plane of the drawing) and 0,X,{|j0,X,, 0,Y,||0.Y, Figure 2b is a 
front view of the scheme. The stabilizer may oscillate relative to the 
satellite about the axis PZ’, which does not coincide with any principal axis 
of the satellite. The equations of motion for this gravitational stabilization 
system are coupled, so that damping of the relative oscillations about the 
axis PZ’ ensures asymptotic stability with respect toall the angular variables. 
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Ideas similar to those of /3, 4/ are expressed in /5, 6, 7, 8/. Paper /5/ 
gives a detailed description of a suspension scheme coupling the satellite 
to the stabilizer, and a damping device. The author suggests a stabilizer 
consisting of three mutually perpendicular tubes of different lengths 
(Figure 3). The tubes are made of initially contracted metal bands which 
unroll and assume the required form under the action of elastic forces. 
The principal (longest) tube 1 is rigidly attached to the satellite and serves 
to reorient the ellipsoid of inertia of the satellite. The two auxiliary tubes 2, 
perpendicular to the principal tube, are fastened together by a single-stage 
suspension and connected to the satellite through a viscous damper, ensuring 
asymptotic stability of the equilibrium position (3 is the orbital plane). 
Figure 4 illustrates a hermetic viscous damper. A part containing a 
toroidal cavity 1, filled with a viscous fluid 2, is fastened to the body of the 
satellite, and a magnet 3 to an auxiliary tube. The magnet holds a steel 
ball 4 between its poles, and so displacement of the magnet relative to the 
cavity causes displacement of the ball in the viscous fluid and dissipation of 
energy. This project has been named ''Vertistat'’ and is being developed 
by the astronautics division of ''General Dynamics." 


A further investigation of the ''Vertistat'' gravity attitude stabilization 
system is givenin /6/. The structural details of the self-unwinding coil, 
the suspension, and the dampers of the auxiliary tubes are considered in 
detail and the effectiveness of the damping device is experimentally verified. 
Methods for reducing the deformation of the material of the long tubular 
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booms due to heating by the sun are studied, among them suitable choice of 
tube material, special coatings, and perforation of the tubes. 

An interesting gravity attitude stabilization system for communication 
satellites has been proposed by ''Bell Telephone Laboratories" /7, 8, 10/ 
(Figure 5). The main difference between this system and that described 
above consists in the damping device. The oscillations of the satellite- 
stabilizer system are damped by hysteresis effects in permeable disks 
moving between small permanent magnets. As stated in these papers, 
hysteresis damping is preferable to other methods since it is determined 
by the oscillation amplitudes and is independent of oscillation frequency 
and temperature variations. Papers /7, 8/ contain a general description 
of the design of the system and results of tests of the damper device, and 
consider the influence of various perturbing torques. 


3. Dynamics of the system 


Papers /9—14/ study the dynamics of gravity attitude stabilization 
systems, considering the following problems: 1) derivation and analysis 
of conditions for stability of the equilibrium position; 2) rapidity of damping 
of natural oscillations; 3) influence of drag; 4) eccentricity oscillations; 
5) errors. 

A detailed analysis of the three-dimensional motion of a satellite- 
stabilizer system in circular and elliptic orbits is given in /9, 11, 14/, 
which adopt the above described scheme of a satellite-stabilizer system 
with a three-stage suspension. Necessary and sufficient conditions for 
asymptotic stability of the equilibrium position of the system are derived 
in general form for motion along a circular orbit in a drag-free medium 
/9/. The stability conditions result from the construction of a Lyapunov 
function. The same conditions may be derived, using a much more 
complicated method, from the Hurwitz inequalities. 

The conditions for stability of the equilibrium position of the system are 
as follows: 


(B, — A,) 4- (Bs — As) — M (6, — bs)? > 0, 
(C, — B;) + (Cy — Bs) + M (0, — b2)® > 0; (1) 


K3"C, (Bi — Ai) + (Bs — As) — M (6, — 62)?)] + 
+ 3((B,; — A;)(B; — As) — Mb} (B; = Ay)— Mo§ (8, — A,))> 09, 
K3"C, ((Cy — Bi) + (Ca — Ba) + M (6, — bs)*) + 
+ ((C, — By) (C2 — Bs) + Moi (Cy — By) + Mb3(C; — B,)] >0, 
3° Ci (C1 — As) + (Cs — As)) + 4 (Ci — As) (C2 — As) > 0; 


(2) 


K!)>0, KP) >0, K?’>0, 
C,+ Mb? >0, A, + Mbi>0, 8: >0, By>0, (3) 
C,C. + MOC, + MbiC,>0, AAs + MbiAs + Mb3A, > 0; 


{(B; — Ai) — Mb, (6: — 6s)) [C2 — M (0; — 64) bg) — 
— {(By— As) + M (6, — bs) ba) [C, + Mb, (6, — 53)] 0. (4) 
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Here [6,| = 0,P;|6,| = 0,P: M = M,M/(M, + M,); KM, K®, K® are the dimen- 
sionless friction coefficients in the damper; K?, K?’ and K?’ are the 
dimensionless coefficients of the elastic forces in the suspension; M,, A;, 
B,.C,, My, Ay, By and C, are the masses and principal central moments of 
satellite and stabilizer, respectively. 

The physical meaning of inequalities (1) is that the satellite-stabilizer 
system should be gravitationally stable in the position of equilibrium, i.e., 
the axis of the smallest moment of inertia of the system as a whole coincides 
with the radius-vector, the axis of the largest moment coincides with the 
binormal to the orbit. 

Conditions (2) impose a lower bound on the fixing torques and constitute 
a strengthening of (1). 

The first three conditions of (3) represent the need for friction in the 
system to ensure asymptotic stability. The remainer of (3) represents 
restrictions on the design of the satellite and stabilizer, connected with the 
damping of the initial perturbations with respect to all the angular variables. 
Condition (4) precludes the particular solution according to which the 
satellite-stabilizer system performs oscillations as a rigid body without 
dissipation of energy. 

Stability analysis for the gravity attitude stabilization system developed 
by ''Bell Telephone Laboratories" is carried out in /10/, which derives 
general conditions for asymptotic stability for two-dimensional oscillations. 
As for the stability of three-dimensional oscillations for given values of the 
system's parameters, the authors suggest using the usual determinant 
inequalities of Hurwitz. 

The rapidity of damping of the natural oscillations is considered in 
/9—13/. In /9—11/, the criterion for minimum duration of the transient 
process is that among the real parts of the roots of the characteristic 
equation, that having the smallest absolute value should be a maximum. 
These papers determine the optimal parameters and present diagrams 
describing the optimal transient process of the system for three-dimensional 
oscillations in a circular orbit. Papers /12, 13/ deal with the determination 
of the time optimal parameters of the system for two-dimensional oscilla- 
tions. In /10/ the problem of optimizing the transient process is solved by 
numerical integration. : 

Depending on the design of the satellite-stabilizer system and the altitude 
of the orbit, one must often allow for atmospheric drag /11/. We shall 
assume that 1) the atmosphere is completely entrained by Earth's rotation; 
2) the orbit is circular; 3) 0O,Y, and O,Y, are axes of symmetry of satellite 
and stabilizer, respectively; 4) the effect of the atmosphere on the body 
reduces to a drag force acting at the center of pressure in a direction 
opposite to the velocity of the center of mass of. the body. The necessary 
and sufficient conditions for asymptotic stability of the natural oscillations 
of the system in a circular orbit with allowance for atmospheric drag have 
the form: 


3 (Bi — Ay) + (Bs — As) — M (6, — by)*) + Ca (1 + 42) >0, 
((Cy — By) + (Cs — Bs) + M (6; —65)*} + Cy (x1 + 9) > 0, (5) 
(C1 — As) + (Cy— As)] > 0; 
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K3°C, {3 [(B: — As) + (Bs — As) — M (by — 2)*) + Cy (me, + %)} + 
+ 3C,x, [(B, — As) — Mb3) + 3C xs ((B, — A) — Mot) + Cixixs + 
+ 9[(B: — Ai) (Bs — As) — Moi (B; — As) — M03 (8,— A;)) >0, 
KC, (Ci — Bi) + (C2 — Ba) + M (6, — b5)*] + Ci (x1 + %2)) + (6) 
+ Cx; [(Cy— Bs) + M3) + Cra [(C1— Bi) + Mbt) + Clix, + 
+ 1(C, — By) (Cy — Ba) + Mb} (Cy — By) + Mb5(C, — B,)) >0, 
KY Cy (C1 — Ar) + (C2 — Aa) + 4 (Cr — Ar) (C2 — As) > 0; 


{Cx + 3[(B; — Ar) — Mb, (6; — 6s)]} [Ca — M (6; — bg) bg] — 


— {Cyx_ + 3((Bs— Ae) + M (b1— 52) 53)} (C1 + Mb, (6; — 8,)) + 0. (7) 
Here 
ie — Qibs — (AQ3 — MaQi)br . ae — Qabs + (MiQ2 — MaQs) bs, 
Y= Se ES Se 
C15 Ciw) 
M, = em MM, = am | Q, and Q, are the drag forces acting on satellite 


and stabilizer; (0, d,, 0) and (0, 6,, 0) are the coordinates of the centers of 
pressure of satellite and stabilizer with respect to the trihedra O,x,y,z, and 
Os%24223; © iS the angular velocity of revolution of the satellite in orbit. 
Conditions (3) are independent of the atmospheric drag. From (5) and (6) 
one easily sees that while stability of the system with respect to roll is due, 
as before, to the gravitational torque, stability with respect to pitch and 
yaw can be ensured by the aerodynamic torque, if x, and x, are taken 
sufficiently large and positive. 

Because of the entrainment of the atmosphere by Earth's rotation, forced 
oscillations with frequency w, form in the yaw and roll angles. The 
amplitude of these oscillations is proportional to the parameter 


@ 2.2 
x= — SINI 
Wo 


(where Q is the angular velocity of Earth's rotation and / is the inclination 
of the orbit) and decreases with the altitude. The maximum amplitude may 
be as much as 2—3 degrees. 

It should be noted that, in principle, the influence of atmospheric drag 
on the oscillations can be eliminated. To this end it is sufficient to make 
x, =x, = 0. The atmospheric drag then affects only the translational motion 
of the system and not the oscillatory motion. 

When inequalities (1) —(4) are satisfied in a circular orbit in a drag-free 
medium, ’the oscillations die out in time and the satellite-stabilizer system 
attains a position of stable equilibrium. In an elliptical orbit there is no 
position of equilibrium. The system performs forced (eccentricity) 
oscillations resulting from the nonuniform rotation of the orbital coordinate 
system. The eccentricity oscillations occur in the orbital plane. The 
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oscillation amplitude R, and R,, of satellite and stabilizer respectively, are 
given by the following expressions: 


6 \3 
2 ga at (tar) HOO 
(xi) ++) + (Kis 


6\2 
KO 4 —) + (Ki) 
————— 
: (xe? + =) + (Ki) 


where e is the eccentricity of the orbit, and 


a= (C,+C,)+ M (6, — 6,)*, 
6 = [C, + Mb, (6, — 6,)) (3 (Bs — Az) — Ca] — 4M, (6,C2 + 8,C,), 
¢ = (3(B, — Ai) —C,] + (3 (Bs — As) —Ca] — 4M (6, — 64)", 
d = (3(B, — Ai) — Ci) [3 (Bs — Az) — C2) — 
— 4Mb} (3 (Bs — As) — Cx] — 4M} [3 (Bi — A) —C,J, 
6 = (3 (By— Ai) — Ci) (Ca — M (6, — be) bs) — 4Mb, (6,C, + 8,C,). 


The amplitude of the eccentricity oscillations is proportional to the 
eccentricity of the orbit and depends on the inertia parameters of the system 
and the coefficients of friction and elasticity. It follows from (8) that if 

K) =0 and K{) + 6/a=0, then R,=0. Thus, the stabilizer parameters can 
be chosen so that the amplitude of eccentricity oscillations of the satellite 

in an elliptic orbit is zero. In this case the stabilizer plays the role of a 
dynamic damper. 

In /3, 9, 11/ the basic case, in which the center of the suspension 
coincides with the point of intersection of the principal dynamic axes 0,P 
and 0O,P of satellite and stabilizer, is considered. Ina circular orbit it 
corresponds to exact tracking following of the orbital coordinate system. 
However, this case cannot be realized in practice, due to errors in the 
construction of the system, namely: 1) the error in determination of the 
center of mass; 2) the error in determination of the principal dynamic 
axes; 3) the error in the zero reading of the elastic torques. 

These errors constitute random influences on the parameters of the 
system. From the viewpoint of stabilization precision, the most interesting 
deviations of the parameters from the rated values are those leading to an 
inhomogeneous system of differential equations of motion. Any small 
deviations of the parameters which do not modify the homogeneity of the 
equations cause only small variations of the transient process and the 
characteristics of steady-state motion in an elliptical orbit. 

By analysis of the precision of the stabilization system, the stabilization 
precision may be related to the errors in determination of the principal 
dynamic axes and of the position of the center of mass, and to the errors 
in the zero reading of the elastic torques. Analysis has shown that in weakly 
elliptical orbits static errors predominate. Stabilization errors connected 
with uncertainty in the parameters of the external medium or with neglected 
external effects have not been considered. 

Since the ''Vertistat'' and ''Bell Telephone Laboratories" projects are 
being developed for comparatively high orbits (including 24-hour orbits), 
the moments of inertia of the satellite must be increased in order to provide 
the necessary restoring gravitational torques. In the projects in question 
this is achieved by spacing masses on long booms (up to 100m). The booms 
are tubes fabricated from thermally treated beryllium-copper bands. For 
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systems with long tubular booms the main perturbations are caused by 
high-frequency elastic oscillations in the booms and nonuniform solar 
heating of the tube surface. An elementary examination of these problems 
is contained in /6, 7, 8/. 


4. Practical implementation of the projects 


The first experimental test of a gravity attitude stabilization system was 
carried out on the satellite ''TRAAC", launched together with the satellite 
"'Transit-IV B' by a single ' Thor-Able Star" carrier rocket on 15 November 
1961 into an orbit with A. = 935km, A, = 850km, i= 32.5°. The results of 
this experiment are not known, but they were apparently quite successful, 
since on 18 December 1962 the first serviceable navigation satellite 
''Transit-V A'' with a gravity attitude stabilization system was launched. 
This launching was unsuccessful, since the systems intended for receiving 
commands and information from ground stations failed on 19 December. 

In 1963 the satellite ''1962 22A" in the same " Transit''series was launched. 
A detailed description of this satellite and results of the launching are given 
in /14/. The satellite is shown in Figure 6. Its weight is ~ 80kg, A. = 
= 770km, hy = 720km, i= 90°, the length of the metal boom is 30m, the 
equilibrium length of the spring is 6m, the weight at the end of the spring 
is 1.75 kg. 

The boom permits elongation of the ellipsoid of inertia of the satellite, 
and the spring promotes dissipation of energy. The spring is made ofa 
beryllium-copper wire of diameter 0.2mm. To ensure good damping, the 
wire is coated with a cadmium layer 0.02 mm thick, and to prevent sublima- 
tion of the cadmium in the high vacuum a layer of gold 0.005 mm thick is 
electrolytically deposited on the cadmium surface. 

The boom is a narrow steel or copper-beryllium strip (50mm wide, 

0.05 mm thick), which can be rolled up into a tube by thermal treatment, 
so that its edges overlap by 180°. This provides the same resistance to 
bending and rigidity as a seamless tube of the same diameter and wall 
thickness. Before the satellite is put into orbit the strip of the boom is 
rolled up, and the weak damping spring is immersed in a block of solid 
sublimating material — diphenyl. 

Before separation from the rocket, the angular velocity of the satellite 
is ~200rpm. Immediately after separation the velocity of revolution 
of the satellite is reduced by a 'yo-yo'' device to ~1 rpm; a magnetic 
damper then reduces the angular velocity to almost zero, and an electro- 
magnet, which orients the axis of the satellite in the direction of the magnetic 
field (which is close to the local vertical for the selected orbit) is activated. 

The boom was extended by command from Earth when the satellite axis 
formed an angle of ~1° with the direction of the magnetic field intensity, 
and ~ 12° with the local vertical. Complete extension of a 30m boom took 
77sec. Immediately after extension of the boom the telemetric system 
showed that the diphenyl blocking device was functioning. Ten hours later, 
when the ‘diphenyl had evaporated, the damping spring began to free itself. 
The spring was released completely in 24hr. 
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To control the precision of the stabilization system, three-component 
magnetometers with resolving power +0.5° and solar direction sensors 
with resolving power +1° were used. To obtain additional information, a 
flashing lamp with a source powered by solar batteries was used as the mass 
at the end of the spring. The operating frequency is a little less than one 
flash per minute. The degree of elongation of the spring may be estimated 
from the light intensity measured at the satellite. The position of the light 
source may also be determined within the limits of a solid angle of 5°, 10°, 
15° relative to the axis of the satellite. 


For an orbiting satellite with average altitude ~ 720km, the principal 
perturbing torques are determined by the interaction of the satellite with 
the magnetic field of the Earth, solar radiation, and atmospheric drag. 
By calculation, the deviation of the axis of the satellite 22A from the 
direction of the local vertical due to the perturbing magnetic torque was 
less than 1°, and that due to atmospheric drag ~ 0.9°. Ina satellite with | 
asymmetric surface distribution relative to the center of mass, a perturbing 
torque is caused by solar radiation pressure. For the satellite 22A this 
effect yields a deviation of 0.3° from the local vertical. The influence of 
solar radiation on the stabilization precision is more important; it is due 
to the nonuniform heating of the long boom, resulting in bending and deviation 
of the satellite axis from the local vertical by ~ 5°. In addition to the steady 
constant deviation, when the satellite emerges from the shadow of the Earth 
into sunlight the boom is given a pulse which may cause oscillations of the 
boom and weight at their natural frequency. The deviation of the satellite 
axis from the local vertical due to the ellipticity of the orbit reaches ~1.15°, 
By processing telemetric data, it was established that all the elements 
of the gravity attitude stabilization system, including the magnetic devices 


231 


SARYCHEV 


for reducing the rotational energy, the magnetic orientation devices, the 
extendable boom, the damping spring, and the attitude control system were 
functioning satisfactorily. Unexpected high frequency oscillations of the 
boom were observed, probably caused by the impulsive temperature change 
in crossing the shadow boundary. This effect leads to stabilization of the 
satellite with a maximum deviation of approximately 10° from the vertical. 
The planned 20° precision of orientation of the antenna to the Earth was thus 
realized. 

The bending of the boom due to heat can be considerably reduced by 
coating it with material which reflects solar energy intensely. Additional 
coating of the boom strip also reduces bending considerably. Thus, for 
example, by coating the boom strip with silver and increasing its overlap 
angle from 90° to 180° the thermal bending is reduced by a factor of 20. 
This will probably facilitate the solution of the problem of static and dynamic 
thermal bending, which arose in the satellite ''1963 22A'', and will permit 
reduction of the deviation of the satellite axis from the local vertical to 1°. 
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SOME PROBLEMS OF ASTRODYNAMICS 


The origin and subject of astrodynamics 


Rapid progress in the investigation and conquest of space has given birth 
to a new field of mechanics —astrodynamics, or the mechanics of space 
flight. The literature dealing with various aspects of the motion of space 
vehicles is considerable and increasing steadily. 

The field is also known as ''cosmodynamics", "theory of motion of 
artificial celestial bodies'', ‘celestial ballistics,'' and so on. We adopt the 
term ''astrodynamics" as shorter and fairly wide-spread in the literature. 

Astrodynamics studies the motion of both unguided and guided space 
vehicles, develops and uses methods of orbit design, and solves other 
dynamic problems concerned with the realization of space flight. 

Formally, astrodynamics is a part of celestial mechanics as defined by 
Laplace —the science studying various mechanical motions of celestial 
bodies due to gravitational forces and others. Astrodynamics may be 
regarded as a modern branch of celestial mechanics; nevertheless, the 
term ''celestial mechanics'' may safely be applied to this science. 

Natural and artificial celestial bodies obey the same laws of motion. 
Astrodynamics therefore uses the methods and results of celestial 
mechanics extensively. At the same time, the specific features of the 
subject matter and the approach have led to consideration of new problems 
and elaboration of new methods outside the scope of traditional celestial 
mechanics. 

Classical celestial mechanics was occupied with the study of the motion 
of natural celestial bodies; however, as man began to penetrate space it 
became necessary to design motions, space flights, and orbits, with a view 
to optimum satisfaction of certain requirements. Typical of these new 
problems are the control of space vehicles in flight, orbit correction, 
maneuvering, motion of space vehicles by low thrust operating during a 
considerable part of the flight. 

The classical part of mechanics —the study of the motion of a body about 
its center of mass—has undergone an interesting development in connection 
with the rotational and liberational motion of satellites under perturbations. 
Many other dynamical problems have arisen. 

In the sequel we shall briefly consider only a few problems of astro- 
dynamics. 
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Orbit design 


As indicated above, the development of space flight entailed the elabora- 
tion of various methods of orbit design. By orbit design we mean selection 
of space orbits with a view to as complete a solution as possible of the 
basic problem — flight with the most economical use of technical means. 

In a preliminary examination, it is essential to study all the orbits 
which provide, even only in principle, a solution of the basic flight problem. 
For example, when planning a flight to the moon it is important to describe 
all feasible trajectories, to be able to determine the initial conditions 
necessary for the realization of any specific trajectory, to determine the 
kinematic and dynamic characteristics of the orbits (flight time, velocity 
of impact with the moon, required initial energy, observation conditions 
from given points on Earth, etc.). 

In the analysis one may select orbits which are optimal both as regards 
effective solution of the basic flight problem, and as regards simplicity and 
economy of realization. These requirements generally conflict, and the 
final solution is often a compromise, allowing for available technological 
posssibilities. . 

One of the most important requirements imposed on an orbit is economy 
in power consumption for launching and injection into orbit. There are two 
basic methods for injection: continuous boost and launching from a satellite 
orbit. The first is technically simpler, but in some cases its realization 
involves difficulties. The selection of the firing site is inevitably restricted, 
and the boost of a space vehicle with a given destination may require 
trajectories which are steeply inclined to the local horizon. This increases 
the losses due to overcoming gravitational force and reduces the terminal 
velocity or (for given injection conditions) the weight of the vehicle. The 
method of continuous boost restricts the range of velocity directions at the 
beginning of the orbital motion, giving preference not only to orbits with 
as low an initial velocity as possible, but also to orbits with as small an 
inclination as possible of the velocity vector to the local horizon at the end 
of the boost phase. 

In the method of launching from a satellite orbit there are no power 
restrictions on the boost direction. Any direction of the velocity vector 
is obtainable by suitable timing of the launching into an intermediate satellite 
orbit (this provides, so to speak, azimuthal sighting, thanks to the rotation 
of the intermediate orbit together with Earth in its diurnal motion) and 
Suitable timing of the launching from the satellite orbit (this provides, so 
to speak, sighting with respect to the angle of elevation, since departure 
from the satellite orbit occurs when the motion in the satellite orbit has the 
required direction). Boost of a space vehicle both when entering a satellite 
orbit and when leaving it occurs at minimum angles of inclination to the 
local horizon and makes maximum use of the capacity of the carrier rocket. 
Soviet mastery of the method of launching from satellite orbits is an 
outstanding technical achievement. The use of such a method of injection 
only restricts the magnitude of the initial velocity in the orbit, permitting 
injection of heavy vehicles and thus extending the range of possible orbits 
and facilitating the conditions for their judicious selection. 

If the satellite is first to be placed in an intermediate orbit, the most 
preferable orbits will be those with the lowest initial velocity. In the case 
of continuous boost it is important that both the velocity and the angle to 
the local horizon be as small as possible. 
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Another group of problems connected with the orbit design investigates 
the precision necessary in the selected nominal orbit and the choice of the 
correction method. If no corrections occur during the flight, the problem 
is to determine the deviation limits of the parameters at the end of the 
boost phase, so that the basic flight problem is solvable if the deviations 
do not exceed these limits. For example, if the flight destination is the 
moon, the only admissible deviations of the injection parameters are those 
for which the orbits pass through the moon. Naturally, the less restrictive 
the limitations on the region of spread of the injection parameters, the 
simpler the realization of the flight, the lower the precision required of 
the equipment effecting the escape, the smaller its weight, and the higher 
its reliability. Thus, the best orbits for a space flight are those permitting 
the largest deviations of the injection parameters. This requirement may, 
and usually does, clash with the power optimality of the orbit; this situation 
is characteristic in orbit design problems. 

It may turn out that the permissible region of initial deviations is 
excessively small and is not feasible at the present stage of technology. 
Moreover, our knowledge of celestial-mechanical constants, such as the 
solar paralax or the orbital elements of planets, may be inadequate, so 
that even ideal realization of the injection conditions does not guarantee 
success. In these cases one must use orbit correction during the flight— 
corrections of the parameters of motion, e.g., by transmitting pulses of 
Suitable magnitude and direction at certain positions in the orbit. Orbit 
correction can be performed either once or several times during the flight. 

For orbit correction the vehicle must have a correcting propulsion unit 
and a propellant reserve. The additional weight required on board for orbit 
correction depends on the magnitude of the correcting pulse or, in the case 
of several corrections, on the total pulse. The magnitude of the correcting 
pulse depends on the spread of the parameters of motion at the end of the 
boost phase and will be larger, the larger the spread. In addition, it 
depends on the position in the orbit at which the correction is carried out. 
For example, a correction carried out too close to the target may require 
an excessive velocity change and a large correcting pulse, and, consequently, 
a considerable additional weight on board the vehicle. 

When selecting orbits we must give preference to orbits in which 
correction is as simple and economical as possible. At the same time the 
problem arises of optimizing the correction, that is, of selecting the orbit 
and the correction positions in such a way that the corrections require 
minimum total pulse and minimum additional weight on board the vehicle. 

Solution of the correction problem requires precise determination of the 
actual parameters of motion during the flight, computation of their deviations 
from the nominal parameters and of the necessary correction parameters. 

Determination of orbit parameters is a classical problem of celestial 
mechanics. However, for space vehicles its solution involves a number of 
specific requirements. For example, it is often necessary to determine 
the orbit parameters as quickly as possible. The computation algorithm, 
which usually includes an iterative process, should therefore be highly 
economical and require few iterations, each performable in a short time. 
The algorithm must also be highly reliable and trouble-free, guaranteeing 
convergence if the initial approximation is chosen badly. 

The precision of the determination of the actual orbit parameters depends 
on the composition and accuracy of the measured parameters, also on the 
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location of the measured interval in the orbit and its length. As a rule, for 
given composition and accuracy of the measurements, the more precise the 
determination of the parameters of the actual motion, (and thus the more 
reliable and precise the possible orbit correction), the larger the section 

of the orbit over which the measurement is performed. Excessive prolonga- 
tion is, however, not desirable, since it may lead to too late a correction 
and an excessively large correcting pulse. 

An early correction may be more economical, but inadequate accuracy 
in determination of the orbital parameters before it is performed may result 
in inadequate correction precision and entail a repeated correction. 

The above considerations illustrate the complexity and contradictory 
nature of the problems involved in the design of an orbit measurement and 
correction system, in other words, in the design of the flight control system. 
An optimum solution would seem to be a system solving the basic flight 
problem in the simplest, most reliable, and most economical way (relative 
to the weight on board). Preference should thus be given to orbits ensuring 
the most optimal flight control. 

In orbit design one must clarify and allow for the diverse conflicting 
requirements imposed on the orbit, some of which are briefly indicated 
above; the orbit must then be subjected to a complex analysis and selected 
with an eye tomaximal satisfaction of the formulated requirements. Ina 
thorough flight analysis a great number of alternatives must be calculated. 
Nevertheless, the computational accuracy required at the initial planning 
Stage is uSually not too high. A reasonable solution is therefore to develop 
and use various approximate techniques for simple, economical and clear 
(though of limited accuracy) analysis of the relevant requirements, and to 
aim at a compromise which provides the best solutions of the problem as 
a whole. 

The last planning stage for the alternatives selected requires more 
accurate calculations, allowing for all factors affecting the flight of a space 
vehicle. These calculations usually employ numerical integration, using 
the most accurate values of the constants, in order to obtain precise values 
of the parameters of flight and injection. Since the refined calculations are 
often very laborious, the need for effective computational methods is no less 
pressing here than in the preliminary planning stage. An effective technique 
should combine the requisite accuracy with speed. Increating the techniques, 
therefore, maximum use must be made of the known factors. For example, 
the motion of a space vehicle around Earth in the latter's sphere of activity 
approximates motion along a conic section with its focus at Earth's center. 
Motion outside the sphere of activity of Earth approximates heliocentric 
motion along an unperturbed orbit,and so on. These facts open the way for 
a perfected technique of refined calculations. Thereare of course alternative 
methods. 

The methods for investigation of orbits are laregely determined by the 
character of the flight. One may distinguish between multi-revolution orbits 
and orbits with small angular distance. The former include the orbits of 
artificial Earth satellites, the moon, and the planets, which perform a large 
number of revolutions during their lifetime. The study and design of such 
orbits involves methods clarifying the evolution of the parameters of the 
osculating orbit in time, under the influence of perturbing factors such as 
the noncentrality of the gravitational field, the effect of the atmosphere, 
perturbations by other celestial bodies, the effect of radiation pressure, etc. 
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Computation of the evolution process may be treated as a problem of 
nonlinear oscillations, and extensive use of various averaging methods and 
asymptotic techniques may lead to simple and effective techniques for both 
the preliminary and the refined calculation. 

Examples of orbits with a small angular distance are flight orbits from 
Earth to the moon, from Earth to Mars, Venus, and other planets. In the 
first approximation, these orbits are arcs of conic sections, and no 
evolution problems arise here. Approximate techniques may conveniently 
be set up either by neglecting perturbations, or by allowing for them ina 
fairly crude manner. Thus, a flight orbit to Mars can be considered to 
consist of portions of conic sections —unperturbed geocentric motion in the 
activity sphere of Earth, unperturbed heliocentric motion outside the 
activity spheres of Earth and Mars, and an unperturbed conic section with 
focus at the center of Mars in the latter's sphere of activity. 

The foregoing considerations are only a few of those involved in orbit 
design. 


Investigation of the motion of space vehicles about the 
center of mass 


The motion of an unguided space vehicle about its center of mass is either 
rotational or librational. In rotational motion the work of the external 
torques during one rotation of the body is small in comparison which its 
kinetic energy; the vehicle performs an almost free Euler motion whose 
parameters vary slowly in time. The motion of unoriented satellites is 
usually rotational. 

In studying rotational motion it is often convenient to use the methods 
of perturbation theory, regarding the Euler motion as unperturbed. By 
averaging one can obtain the evolution equations describing the variation of 
the angular momentum vector and other parameters of the Euler motion. 

In this connection the simplest case is that of dynamic symmetry. 

By similar methods it has been possible, in particular, to carry out 
fairly detailed investigations of the influence of gravitational, aerodynamic, 
and magnetic torques and certain other factors on satellite motion. 

Further investigations require a more detailed determination of the 
influence of dissipative factors and a more detailed consideration of the 
general case (not assuming dynamic symmetry). 

A space vehicle may oscillate about some unperturbed position due to 
perturbations. Such motions are called librational. An example is the 
moon, which maintains an almost unchanged position (turning one face 
permanently to Earth), performing only very small oscillations —librations. 
This natural stabilization is caused by the small deviation of the ellipsoid 
of inertia of the moon from a sphere, and the appearance of restoring torques 
due to the gravitational field of Earth when deviations from the position of 
equilibrium occur. 

The same effect can be used to stabilize artificial satellites. The 
requisite satellite attitude stabilization in space can be achieved by providing 
the satellite with a special body—a stabilizer, to ensure gravitational 
Stability of the satellite-stabilizer system. In order to prevent the satellite 
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and to ensure oscillations sufficiently small in amplitude, the perturbations 
forming upon separation from the carrier must be Suppressed and damping 
introduced in the satellite-stabilizer system. Such damping can be achieved, 
for instance, by introducing relative mobility of satellite and stabilizer and 
dissipative coupling in relative displacements. 

For a correct interpretation of the readings of scientific instruments 
mounted on unoriented research satellites, it is often necessary to know the 
orientation of the satellite at the time of measurement. The required 
information is obtainable from the readings of both the research instruments 
and special orientation sensors. However, in many cases this information 
is incomplete, and direct determination of the atitude of the satellite is not 
sufficiently accurate. 

In this case considerable progress can be achieved by processing the 
available information statistically, taking the known regularities of the 
motion into account. The equations of motion can take into account the 
effect of both known and incompletely known factors, where the uncertainty 
in the latter depends on unknown parameters to be determined along with 
the parameters of motion. The resulting problem of determining the motion 
and calculating the orientation of the satellite at any time is analogous to 
that of processing trajectory measurements, and is an integral part of the 
space research. 

Apart from its direct utilitarian value, the problem of determining the 
motion about the center of mass from measurements is extremely important 
for studying the regularities of the motion, ascertaining dynamic effects, 
determining perturbing torques and their influence, and constructing a sound 
dynamic scheme of the space vehicle. 

An important field in the study of space motion, though unfortunately very 
little studied, is that studying the influence of fluid contents. 

One interesting problem concerning fluid-filled cavities is that of the 
decay and evolution of rotational motions due to dissipation caused by a 
viscous fluid. If the fluid has a free surface the motion is very complicated. 
The fairly well-developed problem of small oscillations of a heavy fluid in 
a vessel under small perturbations of the free plane surface becomes 
meaningless in conditions of weightlessness. Effects suchas surface tension 
become important. 

All these interesting problems, on the border line between hydrodynamics 
and rigid body dynamics, are still open. 

This section in no way exhausts the diversity of mechanical problems 
arising in the motion of space vehicles about the center of mass; our modest 
purpose was a brief outline of some of them. 
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